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LETTER FROM THE EDITOR 


Who Are You® 


- What Did You Say? 


any of you have taken the time to give us your very valuable feedback about how we can con- 
M tinue to improve this Xcell Journal. After all, it is your journal, and its only purpose is to make 
your job easier and more productive, while also providing insights into the trends and technologies that 
are shaping the future of logic design. The overwhelming majority of responses indicated that Xcell is 


a huge success, often read cover to cover, and then saved for later reference. Thank you! 
Here’s some of what we learned from our reader survey: 


¢ Most of you are design/development engineers (74%), doing digital logic design using FPGAs 
(88%) and CPLDs (76%), for industrial (38%), networking (35%), data processing (25%), and 
military (24%) applications, in companies of less than 500 employees (60%). 


¢ Your three most popular categories are technical (“how to”) articles, new product announcements, 
and the product reference guides. (You can bet these sections will grow fatter in future editions.) 


Your least popular category is Customer Success stories. 


¢ Most of you read Xcell Online and would still like to receive the printed journal, monthly — 
almost everyone wanted more information, more often. 


More Information, More Often 


If you haven't noticed, Xcell Online is a fast and efficient way to stay informed about Xilinx and its 
partners. And, it keeps getting better. As we develop the website, you will see many more articles 
about the topics that are important to you. And, you will find many valuable articles that we could 
not fit into the printed Xcell Journal (we receive far more content than we can print). You can see Xcell 


Online, and other Xilinx literature, at: www.xilinx.com/literature/. 
Virtex-II Pro and the New Programmable World 2002 


This issue of the Xcell Journal focuses not only on a new product, but also a revolutionary new 
development paradigm. Our new Virtex-I] Pro™ family of Platform FPGAs is truly unique and 
very powerful — it promises to change forever the way you approach system-level design. As this 


new technology quickly evolves, you will find the latest technical articles, online, as soon as we can 


publish them, on Xcell Online. 


You will also have the opportunity to hear the industry leaders discussing this new design paradigm, and 
receive valuable training, at the Xilinx Programmable World 2002 event in April. Attend Programmable 


World 2002 either online or in person at various locations. See www.xilinx.com/pw2002 for details. 


Thanks again for your interest in Xilinx and Xcell. Our primary goal is to help you succeed, so please 
continue to send me your comments and suggestions. 


a" - 

” nn =o ra =—- 
(tba? (eben 
Carlis Collins 


editor@xilinx.com 
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There is no better way to keep up to date on the latest 
programmable logic technologies trom Xilinx and its partners. 


Visit Xcell Online for the latest news 
and information, as soon as it arrives. 


You can view and download the latest Xcell articles, 
and all up-to-date Xilinx literature, 
~ by visiting: www.xilinx.com /literature /index.htm. 





Contents Spring 2002 


Interesting Times at Xilinx 

The New Era of Programmable Systems 
Virtex-ll Pro: The Platform Has Arrived 
Virtex-ll FPGAs — Product of the Year 

The PowerPC — A Programmer's View 
IBM's PowerPC Strategy Road Map 
Rocket 1/0 Multi-Gigabit Transceivers 
Bestof-Class Embedded Software Tools 
ISE 4.2i Expands Design Productivity 

New Contig. Options for Virtex-ll Pro 
Platform FPGAs Take on ASIC SOCs 

Your Recontiguration is in the E-mail 
SignOnce and Break the IP Barrier 

Bring on the Music — Take out the Noise 
Programmable Solutions for Set-Top Boxes 
Low-cost Digital Video Reference Design 
Programmable Logic Enables Digital Displays 
Digital Convergence Demands Reprogrammability 
The FlexBench Tool Suite 

Modular Engineering Solutions Plattorm 
Parallel Cable IV Connects Faster 

SCCII Microsequencer 

Upgrade to Synopsys FPGA Compiler II 
Rocket 1/0 Multi-Gigabit Design Kit 
CoolRunner-Il CPLD Development Kit 
Virtex-ll Pro FPGA Data Sheet Overview 


Xilinx Product Reference Guide 


XCEll joumal 








Viev 
from the top 


| | ti ] 
® | @ ! 
T X Witness the worldwide, world-class debut 
(] | | { x of the Virtex-II Pro Plattorm FPGA solution 
Tor programmable systems. 
| 4 


f : 












4 - 
i |e 
. \ : 
' L, ‘ 
j P | 
4 I 
| : i\ 
1 5 
4 hi 
; F] rho ¢ ‘ 
| ti 
a 
L 
. r ’ 
} 
1 
j | 
f 
——— * f i F : 
SS _—, ay Lich a ic 7 _ > — ah 
= al AY ' i | ‘ lw 
4 Xcell Journal i Se | wy Spring 2002 =) 








by Wim Roelandts 
CEO, Xilinx 


There is an 
ancient curse: 
May you live in 
interesting times. 


No doubt, we 


are living in 





interesting times. I choose, however, to view 
this as a blessing rather than a curse. True, 
competition is fierce and economic condi- 
tions are chaotic, but opportunities for 
growth and change in the programmable 


logic industry are limitless. 


Big drops in revenues and even the sudden 
disappearance of large customers have 
forced us to revisit our strategies and redou- 
ble our efforts. Preserving capital, maintain- 
ing time to market, coping with lower budg- 
ets, reducing unnecessary risks, and staying 
the course in the face of uncertain market 


conditions has tested the mettle of all of us. 


As we climb back up from the bottom of 
this recession, we are bringing with us a 
new paradigm for Xilinx®. With the 
introduction of the Virtex-I] Pro™ 
Platform FPGA, we have changed from a 
programmable logic supplier to a purvey- 


or of programmable systems. 


What If? 
Consider this: 


e What if your corporation had access to 
an off-the-shelf, system-level product 
that allowed your design teams the max- 
imum flexibility at system level without 


the traditional inventory risks? 


¢ What if this off-the-shelf product had all 
the latest functionality they were look- 
ing for — and were way ahead of stan- 


dard cell technologies? 


¢ What if they could chose to never again 
deal with 0.13 micron silicon design 


issues or budget for huge NRE expenses? 


The what-ifs for this dream can go on and 
on — but this is not a dream. This is what 
a Virtex-II Pro Platform FPGA solution 


can do for you right now. 
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Strategic Partnerships 


Xilinx, IBM®, and Conexant™ Systems 
have been quietly working together to 
respond to the issues and challenges facing 
design teams and their corporations. 
Virtex-II Pro Platform FPGAs feature as 
many as four IBM PowerPC™ 405 
processors immersed and embedded with- 
in the FPGA fabric. Moreover, the Virtex- 


II Pro devices connect to the outside 





world via as many as 16 Rocket I/O™ 
3.125 multi-gigabit serial transceivers 
capable of interfacing with multiple paral- 


lel and serial protocols and standards. 


The Virtex-IJ Pro solution delivers both 
high-performance processing and high- 
bandwidth connectivity all in one 

device. And that’s not all. 
Xilinx XCITE™ digitally 
controlled impedance 
technology removes hun- 
dreds 
resistors from the printed 
circuit board. Xilinx IP 


and Active 


of termination 


Immersion 
Interconnect technologies 
allow the PowerPC processors to 

bypass peripheral bus bottlenecks to 
connect directly with the FPGA logic and 


memory array. 


Our partnerships on the software side with 
Wind River Systems, The MathWorks, 
Cadence Systems, Mentor Graphics, 
Synopsys, Synplicity, and more have paid 





— 


off as well. The Virtex-II Pro solution 
comes with a complete set of Xilinx-specif- 
ic embedded software tools for develop- 


ment, simulation, and debugging. 


The close alliance with our partners and the 
tight integration of hardware and software 
in the Virtex-II Pro platform allows on- 
demand architectural synthesis with tremen- 
dous flexibility and scalability. You can effi- 
ciently divide complex functions between 
high-speed implementation in hardware and 


high-flexibility implementation in software. 
See for Yourself 


In the process of delivering the Virtex-I] 
Pro solution, Xilinx has had to change its 
infrastructure to go beyond being a pro- 
grammable logic supplier into becoming a 
programmable system provider. We knew 
that being a system-level provider didnt 
mean just innovation in silicon. It meant 
acquisitions and alliances in areas of I/O 
speed and connectivity, software develop- 
ment, design services — and preparing our 
entire workforce — from the experts in the 
field to the experts in customer support — 
to truly deliver a complete solution to our 


partners and customers. 


Talk is cheap. So, let us show you what the 
family of Virtex-II Pro Platform FPGAs 
can do for you. I personally invite you and 
your engineering design teams to 
Programmable World 2002 to be 

held Apri 7#in Sam@lose, 
Boston,» Paris — and more 
sites to be announced later. 
This worldwide exposi- 
tion and conference will 
offer a general session and 
feature 16 presentations in 


four technical tracks. 


Registration is mandatory, 

seating is limited, but participa- 
tion is free. We, and our world-class part- 
ners, want to train you for the next gener- 
ation of embedded, system-level program- 
mable devices. For more information, 
read this issue of Xcell Journal (including 
the back cover) and register online at 


www.xilinx.com/pw2002. 
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Programmable Systems 


The New Era ot 


Programmable Systems 


The next breakthrough in processing and system design methodology comes 
from the merger of the most advanced technologies trom Xilinx and IBM. 
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by Babak Hedayati 


Sr. Director of Product Solutions and Partnerships 
Xilinx, Inc. 
babak.hedayati@xilinx.com 


Over the course of the semiconductor 
revolution, with the help of Moore's Law, 
FPGAs have grown to densities of 10 
million gates. They have consumed key 
system-level functions such as block 
memory, clock management, digitally 
controlled impedance matching, embed- 
ded multipliers, 844 Mbps LVDS I/Os, 
and many other functions. As the densi- 
ties increased so did the insatiable hunger 
for soft IP cores for simple functions, 
complex DSP algorithms, networking 
protocols, interfaces, and so on. 


The advantage gained by increased FPGA 
capabilities — sometimes unbelievable to 
some — has been adopted by thousands of 
design teams looking to improve their 
time to market by targeting segments of 
their system to Virtex™-II FPGAs when- 
ever possible. Yet few could initially imag- 
ine the possibilities of a “programmable 
system” when Xilinx talked of immersing 
300 MHz IBM® PowerPC™ 405 proces- 
sors into the Virtex-II FPGA fabric and 
embedding high-speed multi-gigabit 


serial I/Os around it. 


Many immediately saw the value of 
integration for cost reduction, increased 
performance, and reliability. Others saw 
the potential for its incredible flexibility 
and scalablity for implementing special- 
ized and high-speed interfaces. The idea 
of extreme hardware parallel processing, 
and multiple processing on the same 
device, enticed many system designers 
to consider the great possibilities of such 
a solution. 


Many engineers expected the next break- 
through in processing and system design 
methodology to come solely from the 
masters of the microprocessor world and 
leading ASIC vendors — but the break- 
through has come from the merger of the 
most advanced technologies from Xilinx 


and IBM. 


The need for high-speed communica- 
tion and increased bandwidth has driv- 
en the rapid evolution of technology 
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throughout multiple industries. Design 
challenges associated with integration, 
high speed interfacing, higher perform- 
ance processing, and new design method- 
ologies must be solved, and the rapid rate 
of change in technology demands hard- 
ware programmability — this time at the 
system level. 


Digital Convergence 


The convergence of voice, video, data pro- 
cessing, and packet processing both on the 
infrastructure equipment and consumer 
products is putting immense pressure on 
corporations and their engineers. They 
must now incorporate computing, net- 
working, wireless, and video imaging tech- 
nologies that previously existed stand-alone 
in their respective markets. 


This digital convergence has been an 
inevitable reality since the early days of the 
electronics industry, where specialized 
equipment and devices demonstrated their 
true potential as soon as they were con- 
nected with other devices. First came the 
telegraph, then the telephone, computers, 
video, the Internet, storage, wireless, and 
the infrastructure behind it all. Now the 
world is incredibly crowded with new con- 
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sumer technologies that seem to pop up on 
a daily basis incorporating new features 
from the digital revolution. They target 
individual niche areas, compete, and often 
become extinct, just as suddenly as they 
were brought to market. This extinction is 
often caused by rapidly changing standards 
and requirements, or by competitive prod- 
ucts putting tremendous pressure on cor- 
porations and their engineering teams. 


Moving Toward Total 
Cost Management 


Rapid product extinction makes executives 
question why their companies are spending 
so much capital on multi-million dollar 
ASIC NREs (Non-Recurring Engineering 
charges) and design automation contracts — 
because today’s economic and technological 
conditions often require design changes 
midway in the development process. Success 
in todays marketplace is accomplished by 
getting to market first, not by designing for 
high volumes and the lowest unit cost — 
spending immense amounts in advance on 
creating custom ASICS, without a guaran- 
teed future, is a recipe for failure. 


Many companies are faced with huge inven- 


tories of ASICs and ASSPs that cannot be 
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Figure I - Leading edge price/performance through integration and reduction of board size 
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re-targeted to multiple products to reduce 
the risk. Though ASICs are often less expen- 
sive for high volume designs, they incur 
much higher overall costs, higher risk, 
longer development cycles, and less time in 
market. It’s time for corporations to evaluate 
the total cost of their development strategy 
as a whole and avoid falling into the pitfalls 
of separating capital investments, develop- 
ment costs, production costs, obsolescence 
costs, and inventory management. 


The coming generation of computers and 
telecommunications equipment is very dif- 
ferent from prior evolutions of information 
technology because it is dramatically revers- 
ing the age-old wisdom of creating specific 
devices for each application. Current sys- 
tems are integrating computers, cell phones, 
game systems, cameras, appliances, automo- 
biles, offices, and homes. Eventually, we will 
likely have only a few types of super systems 
remaining that synthesize and extend the 
capabilities of all current systems. 


One of the key trends to reduce cost, 
increase performance, and increase the reli- 
ability of systems has been through integra- 
tion. However when designers integrated 
their systems into custom ASICs, they 
increase inventory risks and require large 
initial up front investments. Hence, com- 
panies find it difficult 
to stop midway to 
change their designs. 
In addition, smaller 
companies or start ups 
find that they must 
commit the majority of 
their funding just to 
develop their platform 


— and sometimes they : 
have a difficult time e 
Scuriiiemamiatec 1S lC £ 
supplier to entertain i 
their development. 5 
A fully — integrated i 
system-level solution 


such as the Virtex-II 
Pro™ family solves all 
of these 
Offering multiple giga- 
bit serial I/Os, the 
fastest FPGA solution 
in the world, up to four 


problems. 


— 
| 
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Power PC 405s, XCITE™ controlled 
impedance technology, and other system- 
level features, you get a smaller board size, 
lower overall costs, and faster time to 
market, as illustrated in Figure 1. 


The Ultimate Connectivity Solution 


For a long time, the original PCI bus was the 
industry standard. To increase the band- 
width, many designers began to use bridges, 
continuing with the parallel, shared bus 
strategy. Then the standard moved to 64-bit 
66 MHz versions, and later to PCI-X run- 
ning at 133 MHz. 


The problems with continuing this strategy 
are obvious. Wider busses require more pins 
and higher cost, and moving to higher fre- 
quencies causes signal integrity issues. Plus, 
the shared bus created more overhead and 
less bandwidth predictability. Although PCI 
will be used for years to come, today’s per- 
formance-hungry applications have already 
moved toward packet switched LVDS-based 
parallel methodologies such as POS PHY 
Level 4, Flexbus 4, RapidlO, Hyper 
Transport, and others. This requires smarter 
protocols and point-to-point interfacing 
between devices and boards. The move was 
welcomed because it increased bandwidth. 
However, it often resulted in increased clock 
skew and signal integrity issues. 
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Figure 2 - The trend towards switched and distributed serial I/O 
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In the last two years companies like Xilinx, 
in partnership with Conexant (and the later 
acquisitions of companies such as 
RocketChips), have discovered how to 
implement mutli-gigabit serial I/Os in 
CMOS technology, making it a cost effec- 
tive method of delivering point-to-point 
serial switched interconnections. [his means 
much higher performance without any side 
effects. Other companies in the silicon 
industry, such as ASSP companies and stan- 
dards committees, are now quickly adopting 
this strategy to reduce cost, increase reliabil- 
ity, and increase bandwidth, as shown in 


Figure 2 and Figure 3. 
The Processing Revolution 


The quest for higher performance process- 
ing is evident in many applications. 
Companies have traditionally turned to 
“farms” of expensive high-performance 
processors to achieve the performance they 
need. In doing so, they have usually faced 
prohibitive costs along with the massive 
efforts of managing and partitioning their 
tasks throughout the processor farm. 


Hardware oriented companies such as net- 
working, telecom, and wireless infrastructure 
developers have taken the lead by imple- 
menting parallel processing in hardware. For 
example, using Xilinx XtremeDSP™ solu- 
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tions, with a single clock cycle they can 
process massive amounts of multiply accu- 
mulate functions (over 600 billion MACs 
per second). On the other hand, when com- 
panies have had to deal with multiple small- 
er tasks, they have 
traditionally turned 
to multiple proces- 
sors, and optimized 
the code for process- 
ing each smaller 
task; a technique 
often used in net- 
work processors, as 
shown in Figure 4. 


Now for the first 
time, through the 
Virtex-II Pro pro- 
grammable system, 
designers get both 
high-performance processing and distributed 
processing through multiple PowerPC 
processors immersed in the FPGA fabric. 
With Virtex-II Pro FPGAs, the whole is 


much more than the sum of the parts. 


Enabling a New System 
Development Paradigm 


Design teams can now make system-level 
tradeoffs and optimization throughout the 
design cycle. Traditionally, architecture- 
level teams have had to make such tradeoffs 
early in system definition phase — deciding 
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Figure 3 - Virtex-II Pro FPGAs support all connectivity standards 
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what goes into hardware what gets imple- 
mented as software code. This restriction 
has been the cause of many delayed prod- 
ucts and products that have been unsuccess- 
ful, because of the inability to make adjust- 
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Figure 4 - Virtex-II Pro FPGAs provide higher performance processing 


ments for new features or performance 
optimizations during the design phase. 
Electronic design automation (EDA) com- 
panies have partially addressed the problem 
by developing system-level tools (such as 
behavioral partitioning tools and so on) to 
make the system-level tradeoffs easier. 


Now, with Xilinx technology, design teams 
can make tradeoffs and optimizations 
throughout the system design; creating a 
more integrated system with higher per- 
formance and faster time to market. They 
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can even make changes 
to their hardware and 
software in the field, 
after the product is in 
the customers’ hands, to 
fix bugs or implement 
new features. New busi- 
ness models can now be 
developed for program- 


mable system design. 


The Virtex-II Pro solu- 
tion provides a standard 
programmable system 
platform, fully support- 
ed by embedded devel- 
opment tool vendors 
such as Wind River 
Systems; EDA compa- 
—, nies such as Cadence, 
Mentor Graphics, and 
Synopsys; and system- 
level tools companies 
such as Celoxica and The Mathworks — all 
industry leaders and strategic partners to 
Xilinx. These partnerships enable a new 
development paradigm for programmable 
systems design. This overall solution of 
devices, software, cores, and partnerships 
means that companies like yours can now 
rest a little easier. 


Welcome to the Programmable World 


Xilinx and its partners are taking the next 
step in the evolution of programmable logic 
by creating a new event — Programmable 
World 2002. Here, you will hear industry 
leaders and visionaries discussing the latest 


Platform FPGA 
detailed technical training for the PowerPC, 


solutions, including 
Wind River tools, and others. From imple- 
mentation techniques for multi-gigabit seri- 
al I/Os to digital signal processing, you will 
hear experts from more than 50 companies 
discussing this revolution in logic design. 


Programmable World 2002 will be held 


simultaneously in multiple locations 
throughout North America and Europe. 


You wont need to travel, but if you do, it’s 
all free with breakfast and lunch included. 


April 17th you can see it all. To get the 
full details, or to register for the technical 
sessions, go to: www.xilinx.com/pw2002. 
Be sure to reserve your seat now; atten- 
dance is limited. 
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Virtex-ll Pro FPGAs: 


The Platform for Programmable 


Systems Has Arrived 


by Anil Telikepalli 

Marketing Manager, Worldwide Marketing 
Xilinx, Inc. 

anil. telikepalli@xilinx.com 


The curtains have been raised! The Virtex-II 
Pro™ Platform FPGA solution — the most 
sophisticated silicon and software product 
ever — is now available for programmable 
system design. Programmable systems repre- 
sent flexible and scalable systems that are 


programmable at the architectural level. The 


goal in developing the Virtex-II Pro FPGA 


was to revolutionize system architecture by 
tightly integrating hardware and software 
functions on a single platform with unprece- 


dented flexibility and scalability. 









To achieve that objective, circuit engineers 
IBM, 
Mindspeed, and Xilinx worked together to 
develop this advanced Platform FPGA. At 


the same time, engineering teams from top 


and system architects from 


embedded systems companies, including 
Wind River Systems and Celoxica, worked 
alongside Xilinx software teams to develop 
the systems software and IP solutions that 


bring a new methodology to system design. 


The result is the first Platform FPGA solu- 
tion capable of implementing ultra-high 
bandwidth SOC (system-on-a-chip) designs 
that were previously the exclusive domain of 
custom ASICs. The Virtex-II Pro presents all 
the advantages of ASICs — and still retains all 
the flexibility and low develop- 

ment cost of program- 
mable logic devices. 
The Virtex-I] Pro 
solution enables 
high performance 
programmable sys- 
tems specifically in 
the areas of wired and 
wireless networking, 
storage systems, profes- 
sional broadcast, embedded 


systems, and digital signal 


The Virtex-ll Pro solution heralds a paradigm shift in 
system architecture by moving design based on zones ot 
programmability to entire system-level programmability. 


processing systems. The new Virtex-II Pro 
FPGAs come in five densities, seven pack- 


ages, and 15 combinations. 
Virtex-Il Pro FPGA Revealed 


As a platform for programmable systems, 
the Virtex-I] Pro FPGA is both flexible and 
scalable throughout all aspects of system 
architecture. By embedding processor cores 
within the FPGA fabric, the Virtex-II Pro 
architecture provides tight coupling between 
high-performance processors and the high- 


yy olerere mo) coyca e-Teovent-10) (ome (o)z4 (enum Resexolu oll emule 


two components enable the’ most optimal 
yet flexible partitioning of hardware and 
software in a programmable system. The 
Virtex-II Pro FPGA is built upon the lead- 
ing Virtex-II™ FPGA architecture with 
Rocket I/O™. multi-gigabit transceivers 
and embedded IBM PowerPC™ proces- 
sors completely immersed into the FPGA 
fabric (Figure 1). 
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Figure I - The Virtex-II Pro XC2VP50 device 
features four IBM PowerPC 405 processors 
and 16 Rocket I/O multi-gigabit transceivers 

embedded in the FPGA fabric. 


Additionally, Virtex-II Pro Platform 
FPGAs offer the following features: 


¢ Five family members with 3,168 to 
50,832 logic cells, and 216 Kb to 3,888 
Kb of block RAM 


¢0.13u , 9-layer copper, low-k technology 


Process 


¢ 3.125 Gbps Rocket I/O multi-gigabit 
serial transceivers based on Mindspeed 
SkyRail™ technology, up to 16 per 


device 


¢ 300+ MHz PowerPC embedded proces- 
sor cores based on IBM’s PowerPC 405 


processor, up to four per device 


¢ Virtex-II IP-Immersion technology pow- 


ered by system-level features: 


- Flexible SelectI/O™-Ultra technology 
supporting 840 Mbps LVDS I/Os 


- Xilinx Controlled Impedance 
Technology (XCITE) capability, 
providing built-in digital impedance 
matching on all single-ended I/Os 


- Embedded 18 Kb dual-port block 
RAM resources 


- Embedded 18-bit x 18-bit multiplier 
blocks 


- DCM (digital clock manager) macros 
support de-skew and frequency/phase 


manipulation 


- Bitstream encryption (Triple-DES) for 


design protection. 
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Rocket I/O Transceivers 


Rocket I/O multi-gigabit transceivers 
(MGTs) are based on Mindspeed 
SkyRail™ CMOS technology. Each full- 
duplex transceiver runs from 622 Mbps to 
3.125 Gbps baud rate and includes the 
entire transceiver support circuitry (Figure 
2). The Rocket I/O blocks are the first 
transceivers embedded in FPGAs to reach a 
baud rate of 3.125 Gbps. Up to 16 MGTs 
bonded 


together to provide 


can be 


an ageregate data = 
rate of 40 Gbps for 
each of transmitter 


and receiver. 32/16/8 bits 


Well-designed seri- 
al transceivers have 


two fundamental 


iver Module 
| Physical Coding Sublayer ] Physical Media Attachment | 
(PCS) (PMA) 
u S 8B/10B i ! | TX+ 
TXDATA R » Serializer e 
_ Oa Slnteveye (<6 / 
C Oo | 
TX Clock Generator 
REFCLK l 


Channel Bonding 
and 


toward designing compact optical net- 
working equipment (impossible if you 


must put in several heat sinks). 


For example, four Rocket I/O blocks 
allow 16 printed circuit board (PCB) 
traces to support full-duplex 10 Gbps 
data rates. This is equivalent to 256 traces 
of typical busses or 68 traces of a high- 
speed parallel bus. Thus, the four Rocket 
I/O blocks allow a 16X reduction of PCB 


Transce! 





fal 


Transmitter 


Receiver 
20X Multiplier 


Clock Correction 


40 — 156.25 MHz 
requirements that 
distinguish them 
from others: 
32/16/8 bits 


¢ Ability to oper- 
ate at multi-giga- 
bit rates to sup- 
port emerging 


standards 


¢ Ability to bundle 
multiple chan- 


nels together for scalable data rate. 


Each Rocket I/O transceiver consists of 
both a digital Physical Coding Sublayer as 
Media 
Attachment to provide a fully integrated 


well as an analog Physical 
serializer/deserializer function that enables 
the entire functionality and performance of 
emerging serial standards (Table 1). 


Historically, serial transceivers have been 
analog components built using SiGe or 
GaAs processes. These transceivers gener- 
ate enormous quantities of heat — and any 
integration of channels to increase the 
data rate was out of the question. 
However, Xilinx Rocket I/O transceivers 
not only provide multi-gigabit data rates, 
they can also be tied together to increase 
the aggregate bandwidth by using the 
built-in channel-bonding capability. This 
scalability is especially important as data 


rates increase and the industry moves 


¢ RXDATE = Elastic 8B/10B (4 Deserializer i 
Buffer BY-Yerefe(-M\ LOO EN De a = 
— small 


Figure 2 - Inside the Rocket I/O block 








RX Clock Recovery 


traces over conventional parallel busses, 
resulting in significant reduction of PCB 
complexity and EMI system noise. In 
short, Rocket I/O technology allows 
higher bandwidth systems than currently 
possible, with cost savings from faster 


time-to-market, reduced power consump- 


Line Speed 


Technology 


3610 2.5 Gbps 
Serial ATA 1.5 Gbps 
IntiniBand 2.5 Gbps 

Gb Ethernet 1.25 Ghps 
10 GE (XAUI) 3.125 Gbps 
Serial RapidlO 1.25 Gbps 
FibreChannel 1.06 Gbps 


Table 1 - Virtex-II Pro Platform FPGAs support 
these protocols and baud rates. 
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tion, smaller PCB size, and 
lower component count 


(Figure 3). 
Ultimate Connectivity 


Virtex-II Pro 
FPGAs, equipped with 
Rocket I/O MGTs, support 
emerging serial connectivity 
standards — and with Xilinx 
SelectI/O™-Ultra technolo- 
gy, these next-generation 
Platform FPGAs also support 
today’s parallel connectivity 
standards (Table 2). Thus, the 
Virtex-II] Pro FPGA serves as 
an ultimate connectivity plat- 
form to bridge across various 
interface standards in chip-to- 
chip, board-to-board, or even 


WAN/MAN/LAN networks. 


In addition to these hardware physical 
interface capabilities, the Virtex-II Pro 
solution provides PowerPC processors and 
soft intellectual property (IP) cores to make 
designing with any protocol easy. 


IBM PowerPC Processors 


Each IBM PowerPC processor runs at 
300+ MHz and 420 Dhrystone MIPS. 
Even though the PowerPC 405 core occu- 
pies a small portion of the die area, it pro- 
vides tremendous system flexibility. Instead 
of attaching the PowerPC 405 processor 
next to the FPGA with a bus interface (as 
certain vendors have attempted), the 
Virtex-IT Pro engineering team embedded 
the processor entirely within the FPGA 
fabric. Using Xilinx IP Immersion and 
Active Interconnect technologies, hundreds 
of processor nodes are directly connected 


to the FPGA logic and memory array. 


Such total immersion gives you the 
utmost flexibility in hardware/software 
system architecture. You can efficiently 
divide complex functions between high- 
speed implementation in hardware and 
high-flexibility implementation in soft- 
ware. This direct-connect configuration 
bypasses the bottleneck of using a bus to 
interface between the FPGA and an 


attached/external processor. 
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Figure 3 - The Virtex-II Pro advantage: reduced PCB board 


space with less complexity and fewer components 


The PowerPC 405 core has unique on-chip 
memory (OCM) controllers that bypass 
the processor bus for fast, direct access to a 
fixed amount of instruction and data mem- 
ory implemented in Xilinx SelectRAM™ 
modules (Figure 4). This is especially useful 


for data streaming applications. 


The PowerPC processor is supported by 
IBM CoreConnect™ technology — a high- 
bandwidth 64-bit bus architecture that 
runs at 100 to 133 MHz. For maximum 
flexibility, the CoreConnect architecture is 
implemented as a soft IP within the Virtex- 
II Pro FPGA fabric (Figure 5). You can add 
CoreConnect peripherals from an extensive 


IP library from Xilinx and third-party part- 


LAN/MAN/WAN 


® 10/100 Ethernet 
© 1Gb Ethernet 

© 10Gb Ethernet 

@ | Gb Ethernet PHY 
@ 10GE XAUI 

@ SONET Standards* 


© PCI 32/33 
© PCI 64/66 
© PCX 100 


@ Fibre Channel 
@ J0GE XAUI 








Board-to-Board 


© Rapidld 
@ (SIX 


© Hyperlransport 
@ Serial RapidlO © InfiniBand 


@ 3610 





ners or develop proprietary 


peripherals of your own. 


The CoreConnect bus architec- 
ture has two main buses, called 
the Processor Local Bus (PLB) 
and the On-chip Peripheral Bus 
(OPB). These buses can be used 
for interfacing high-speed and 
low-speed peripherals with the 
PowerPC processor respectively. 
Additionally, a third Device 
Control Register (DCR) bus is 
used for transfers to and from 
peripheral 


general purpose 


device registers. 


! The Virtex-II Pro Platform 
FPGA is also supported by a 
complete set of embedded soft- 
ware tools for development and 

debug. Through an OEM agreement, 

Xilinx is able to provide software tools from 

Wind River Systems that are customized 

for Virtex-II Pro FPGAs. These include: 

¢ Diab™ XE (Xilinx Edition) compiler 

¢ SingleStep™ XE software debugger 

¢ visionPROBE II XE JTAG run control 


hardware connection probe. 


In addition, a suite of GNU (open-code 


Linux) tools is also available. 


With the Virtex-II Pro solution, you can 
use the FPGA fabric for highly parallel pro- 
cessing and fixed algorithms. You can use 


the PowerPC processor for sequential com- 


Chip-to-Chip 
© PC32/33 —@ POSPHY 13/14 
@ PC 64/66 ® Flexbus 4 
@ PC-X66 & 100 ©@ CSIX 
® Rapidld 
@ | 0GE XAUI 


© Hyperlransport 
@ 3610 


©@ Serial standards enabled by Rocket 1/0 technology — Virtex-II Pro FPGAs 


Table 2 - Virtex-II and 
Virtex-II Pro FPGAs offer 


multiprotocol connectivity. 


© Parallel standards enabled by Select!/O-Ultra technology — Virtex-ll & Virtey-ll Pro FPGAs 
* SONET compatible, supports data rate only 
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Figure 4 - Inside the PowerPC 405 processor 
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Figure 5 - Hard and soft IP cores 


puting, exception handling, and control 
functions. The Rocket I/O serial trans- 
ceivers and SelectI/O-Ultra parallel tech- 
nologies enable optimal data access into 


and out of the Virtex-II Pro FPGA. 


On-Demand Architectural Synthesis 


Architectural synthesis is a combination of 
tools and technologies that allows designers 
to specify high-level requirements for design- 
ing their systems. In other words, architec- 
tural synthesis is a tool-based partitioning of 


hardware and software. 


In order for architectural synthesis to work, 
both the hardware and software components 
of the system must be tightly integrated. 
Virtex-II Pro Platform FPGAs enable on- 
demand architectural synthesis with tremen- 
dously flexible, scalable, and high-bandwidth 
features. You can perform architectural syn- 
thesis anytime in the product cycle — during 
system design and debug phases, or even 
after the product has shipped. With abun- 
dant resources of hardware and _ software, 
Virtex-II Pro FPGAs give you the flexibility 


and scalability for fine-tuned system archi- 
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Debug 


Instruction 





Low-Speed 
Peripheral 


tecture, partitioned optimal- 
ly between hardware and 


software. 


Price & Performance Leader 


Timers 


a Leading-edge systems need 


high bandwidth serial I/O, 
which has only been achiev- 


able by 


FPGA to an external serial 


Logic 


interfacing an 
Trace 
transceiver by means of 
hundreds of pins. Similarly, 
high-performance systems 
typically require one or 
more processors on the 
board, creating even more 


connectivity problems and 







PCB complexity. 





Me) 
Interfaces . : 2 
By immersing multi-giga- 


bit transceiver blocks and 


processor cores within 
the FPGA fabric (Figure 
6), the Virtex-II Pro 


Platform FPGA delivers 
the best price and perform- 
ance. Integrating proces- 
sors and transceivers within 
the FPGA fabric lowers costs and raises 


performance by: 


¢ Saving PCB space 
e Simplifying PCB complexity 
¢ Requiring fewer components 


¢ Eliminating complex device interconnec- 


tivity issues 


¢ Reducing overall system power con- 


sumption 


¢ Using XCITE digitally controlled imped- 
ance technology to do away with external 


termination resistors 


¢ Enabling optimal system partitioning 


between hardware and software. 


A New Development Paradigm 


By tightly integrating flexible and scalable 
high-performance programmable logic 
with the PowerPC processor, the Virtex-II 
Pro Platform FPGA _ fundamentally 
changes the way systems are designed. The 
Virtex-II Pro solution facilitates a new par- 


adigm in system development with signifi- 


| Metal 4 
| Metal 3 
| Metal 2 _ 
| Metal 1 


| Metal 4 
| Metal 3 
| Metal 2 _ 
| Metal 1 
Poly 


ade)" (=) oa Os 
Immersed 
into FPGA 


Poly 


Silicon Substrate 





Figure 6 - Hard IP cores are deeply 
embedded and actively connected within 
the FPGA fabric 


cant benefits, specifically in software engi- 


neering productivity: 


¢ Embedded processors can be used for 
rapid system pre-production to facilitate 
accelerated software development. A pre- 
liminary hardware platform can be built 
quickly by emulating a C-based algorithm 
using the embedded processors. This cre- 
ation of a preliminary hardware platform 
allows software development to start 
much earlier in the design process, com- 


pared to current practice. 


Software debugging can be performed at 
hardware speeds while the hardware imple- 
mentation continues to be speed-opti- 
mized. Xilinx ChipScope™ Pro on-chip 
verification tools provide in-system observ- 


ability into both the FPGA hardware and 


the processor bus transactions. 
Conclusion 


The Virtex-II Pro Platform FPGA solution 


encompasses the following: 


¢ Rocket I/O 


PowerPC processors 


and IBM 


and 


transceivers 
immersed 
embedded into the high performance 
Virtex-II Pro FPGA fabric 


¢ Intellectual property solutions, including 
soft peripherals and connectivity cores 


¢Complete design resources, including 


development tools and kits 


To find out more about this revolutionary 
next-generation Platform FPGA for pro- 
grammable systems, go to www.xilinx.com/ 


virtex2pro. 
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by Xilinx Staff 
For 26 years, Electronic Products has held 


an annual contest to choose the most out- 
standing products introduced each year. The 
editorial board at the magazine considers 
thousands of product introductions based 
on significant advances in technology or its 
application, a decided innovation in design, 
or a substantial gain in price-performance 
benefits. As usual, picking winners was 
made difficult by the many impressive prod- 


ucts announced during the year. 


The Xilinx Virtex™-II FPGA family was 
recognized as the FPGA platform to address 
next-generation designs. “Programmable 
logic has heretofore been limited to relative- 
ly simple computational tasks and glue 
logic functions,” said David Suchman, 
Digital IC editor at Electronic Products. 
“Now, for the first time, a programmable 
platform is available from Xilinx to enable 
rapid development of today’s technically 
challenging applications.” 


The ever-increasing requirements for 
higher performance and system-level fea- 


tures are bringing new challenges as 
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designers develop the next-generation of 
complex high-performance digital appli- 
cations such as data communications and 
DSP systems. Characterized by high 
logic integration, fast and complex rout- 
ing of wide buses, and extensive require- 
ments for pipeline and FIFO memory, 
these new systems exceed the capabilities 
of current programmable logic devices, 
which lack the gate capacity, memory, 
routing resources, performance, and 
architecture flexibility that is required to 
designs. The 
Electronic Products’ editors judged that 


fully support these 
the Xilinx Virtex-II series (see Electronic 
Products, May 2001) also solves the 
problems resulting from signal integrity, 
system timing, EMI, and security issues 


in these complex systems. 


Be Vitex FPGAs - 
wy Product 01 
Year 


Xilinx recently received the Product of the Year Award 
from Electronic Products magazine — and we were 
the only programmable logic supplier to receive this award. 


The Virtex-II family allows unlimited design 
changes throughout the development and 
production phases for optical networks, giga- 
bit routers, wireless cellular base stations, 
modem arrays, and video broadcast systems. 
Capable of handling designs from 40,000 to 
10 million system gates, Virtex Series FPGAs 
feature an interconnect architecture for opti- 
mizing routing, and an advanced memory 
array with up to 4.5 Mbits of on-chip mem- 
ory. An additional feature is the industry's 
first digitally controlled impedance technolo- 
gy, which maintains constant impedance 
even with temperature and voltage fluctua- 
tions — eliminating hundreds of termination 
resistors, saving board space, increasing relia- 


bility, and lowering costs. 


For more information on the winners, go to 


www.electronicproducts.com. 


“PROGRAMMABLE LOGIC HAS HERETOFORE BEEN LIMITED 10 RELATIVELY SIMPLE 
COMPUTATIONAL TASKS AND GLUE LOGIC FUNCTIONS. NOW, FOR THE FIRST TIME, A PROGRAMMABLE 
PLATFORM 1S AVAILABLE FROM XILINX TO ENABLE RAPID DEVELOPMENT OF TODAY'S TECHNICALLY 
CHALLENGING APPLICATIONS. — avo Sucawan, Otc 1 corn ar ELecraonic PRooucts. 
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by Anthony Marsala 
IBM 


The PowerPC™ Architecture is a Reduced 
Instruction Set Computer (RISC) architec- 
ture, with over two hundred defined instruc- 
tions. PowerPC is RISC in that most instruc- 
tions execute in a single-cycle and typically 
perform a single operation (such as loading 
storage to a register, or storing a register to 
memory). This article will focus solely on 32- 
bit implementations, which are the most 
widely available today. 


The PowerPC architecture employs a layered 
approach, in that it is broken up into three 
levels or “books”. By segmenting the architec- 
ture in this way, code compatibility can be 
maintained across implementations while 
leaving room for implementations to choose 
levels of complexity for price/performances 
tradeoffs. The three levels are broken up from 
the most general and common across imple- 
mentations to the most operating system spe- 
cific. The levels are: 


¢ Book 1. User Instruction Set Architecture — 
This level defines the base set of instructions 
and registers that should be common to all 
PowerPC implementations. 


¢ Book 2. Virtual Environment Architecture — 
This level defines additional user-level func- 
tionality that is outside the normal applica- 
tion software requirements. Areas include 
cache management, atomic operations, and 
user-level timer support. 


° Book 3. Operating Environment Architecture 
— This level defines privileged operations typi- 
cally required by an operating system. Areas 
include memory management, exception vector 
processing, privileged register access, and privi- 
leged timer access. 


Editors note: This article is reprinted with 
permission from IBM. It was originally a 
two-part series that ran in the April, 2001, 
IBM PowerPC Processor News. You can view 
the original articles, and find other useful 
information at: www-3.ibm.com/chips/products/ 
powerpc/newsletter/apr200 I/design-h-t.html. 
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Deviations from the original PowerPC 
Architecture offer flexibility to allow for 
enhancements that may come over time. In 
addition, IBM has defined its own Virtual 
Environment and Operating Environment 
levels for its PowerPC 400 family of 


embedded controllers. 
Book E — A New Definition 


A new PowerPC architecture update has 
been developed. Called “Book E”, it com- 
bines the original three architecture levels 
into one new specification. This new spec- 
ification also streamlines the definition of 
64-bit implementations and eliminates 
non-substantive differences between IBM 
and Motorola implementations. The new 
standard maintains 100% code compatibil- 
ity with Book 1 instructions and registers, 
while formally defining software-based 
memory management, a two-level inter- 
rupt hierarchy, and user-extendible instruc- 
tion space for auxiliary processors. All of 
these enhancements address the needs of 
embedded systems. 


To distinguish between the original architec- 
ture, the IBM embedded definitions, and 
Book E, the original architecture will be 
referred to as the “classic” architecture for the 


remainder of this article. 
Storage Model 


The 32-bit PowerPC architecture has native 
support for byte, halfword (16-bits), and 
word (32-bit) data types. Also, PowerPC 
implementations can handle string opera- 
tions for multi-byte strings up to 128 bytes 
in length. The 32-bit PowerPC implementa- 
tions support a 4 GB address space (237). All 
storage is byte addressable. For misaligned 
data accesses, alignment support varies by 
product family, with some taking exceptions 
and others handling the access through mul- 


tiple operations in hardware. 
Endianness 


Classic PowerPC and the IBM PowerPC 
400 family are primarily big-endian 
machines, meaning that for halfword and 
word accesses, the most-significant byte 
(MSB) is at the lowest address. Support for 
little endian varies by implementation. 


Classic PowerPC had minimal support, 
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while the 400 family provides more robust 


support for little endian storage. 


Book E is endian-neutral, as the Book E 
architecture fully supports both accessing 
method. 


Registers 


Classic PowerPC registers are broken into 
two classes: special-purpose registers (SPRs) 
and general-purpose registers (GPRs). IBM’s 
PowerPC 400 family and Book E also define 
a third class of registers, called device control 
registers (DCRs), to address peripheral regis- 
ters outside of the processor core in an 
embedded controller implementation. The 


three classes are explained below. 
SPRs 


SPRs give status and control of resources 
within the processor core. Table 1 shows dif- 
ferent types of SPRs and their purpose. 
Where a single register exists, the SPR name 


is listed in parenthesis. 
Supervisor vs User-Mode SPRs 


When the processor is first initialized, it is 
in supervisor (also called privileged) mode. 
In this mode, all processor resources, 
including registers and instructions, are 


accessible. The processor can limit access to 


SPR Register Type 
Count (CTR) User 
Link (LR) User 
Save /Restore Supervisor 
Debug Supervisor 


Timers User (read) 


Supervisor (write 


Interrupt Vector Prefix Supervisor 


Exception Supervisor 
Storage Attribute Control Supervisor 
Processor Version (PVR) Supervisor 
General Purpose (SPRGn) Supervisor 
Integer Exception (XER) User 


MMU Supervisor 


Table 1: SPR registers 


Access Mode 


certain privileged registers and instructions 
by placing itself in user (also called prob- 
lem-state) mode. This protection limits 
application code from being able to modify 
global and sensitive resources, such as the 
caches, memory management system, and 
timers. Mode switching is controlled via the 
Machine State Register. 


¢ The Instruction Address Register (IAR) is 
known to programmers as the program 
counter or instruction pointer. It is the 
address of the current instruction. This is 
really a pseudo-register, as it is not directly 
available to the user. The IAR is primarily 
used by debuggers to show the next 
instruction to be executed. 


¢ The Processor Version Register (PVR) is 
useful for code common across multiple 
processors that must make decisions based 


on a specific processor. 
User-Mode SPRs 


There are four SPRs available in user-mode 


that are important to understand: 


¢ The Link Register (LR) is a 32-bit register 
that contains the address to return to at the 
end of a function call. Certain branch 
instructions can automatically load the LR 


to the instruction following the branch. 


Purpose 


Timing Facilities 


Locates Interrupt Addresses 
State information where exceptions occur 


Controls Storage Attributes (W,I,G,LE) 


Identities PowerPC Implementation 


Used by Operating Systems 
Carry Bit, Overtlow, String Lengths 


Instruction /Data Translation Control 
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The blr instruction moves the program 
counter to the address in the LR. 


¢The Fixed Point Exception Register 
(XER) contains overflow information 
from fixed point arithmetic operations. It 
also contains carry input to arithmetic 
operations and the number of bytes to 
transfer during load and store string 


instructions Iswx and stswx. 


¢ The Count Register (CTR) contains a loop 
counter that is decremented on certain 
branch operations. Also, the conditional 


branch instruction bectrx branches to the 


value in the CTR. 


¢ The Condition Register (CR) is grouped 
into eight fields, where each field is 4 bits 
that signify the result of an instruction’s 
operation: Equal (EQ), Greater Than 
(GT), Less Than (LT), and Summary 
Overflow (SO). 


Machine State Register (MSR) 


MSRs represent the state of the machine. It 
is accessed only in supervisor mode, and 
contains the settings for things such as 
memory translation, cache settings, inter- 
rupt enables, user/privileged state, and 
floating point availability. Exact control 


bits vary by implementation. 


The MSR does not readily fit into the 
SPR/DCR/GPR classification, as it con- 
tains its own pair of instructions (mfmsr / 


mtmsr) to read and write the contents of 


the MSR into a GPR. 
DCRs 


DCRs are similar to SPRs in that they give 
status and control information, but DCRs 
are for resources outside the processor core. 
DCRs allow for memory-mapped I/O con- 
trol without using up portions of the 32-bit 


memory address space. 
GPRs 


The User Instruction Set Architecture 
(Level 1) specifies that all implementations 
have 32 GPRs (GPRO - GPR31). GPRs are 
the source and destination of all fixed-point 
operations and load/store operations. They 
also provide access to SPRs and DCRs. 


They are all available for use in every 


18 Xcell Journal 


instruction with one exception: In certain 
instructions, GPRO simply means “0” and 


no lookup is done for GPRO’s contents. 
Instructions 


Table 2 lists different instruction cate- 
gories, and the types of instructions that 


exist in that category. 


Deciphering an Instruction 


For 32-bit implementations, all instruc- 
tions are 32 bits (4 bytes) in length. Bit 
numberings for PowerPC are opposite of 
most other definitions; bit 0 is the most 
significant bit, and bit 31 is the least sig- 


nificant bit. Instructions are first decoded 


Instruction Category 


Data Movement 
Arithmetic 
Logical 
Comparison 
Branch 
Condition 
Rotate /Shitt 
Cache Control 


Interrupt Control 


Processor Management 


MMU Control 
MAC Unit 


Table 2 - Instruction categories 


ri |) a) 
food 
od 
jo 
oris 


0 


= 


or X 
OF. XOr 
ori xorl 


xoris 


Exclusive OR Rotate and Mask Shift Misc. 


Base Instructions 

load, store 

add, subtract, negate, multiply, divide 

and, or, xor, nand, nor, xnor, sign extension, count leading zeros, andc, orc 
compare algebraic, compare logical, compare immediate 

branch, branch conditional, branch to LR, branch to CTR 

rand, crnor, crxnor, crxor, crandc, crorc, crnand, cror, cr move 

rotate, rotate and mask, shitt left, shift right 

invalidate, touch, zero, flush, store, dcread, icread 


write fo external interrupt enable bit, move to/from machine state register, 
return trom interrupt, return from critical interrupt 


system call, synchronize, eieio, move to/trom device control registers, 
move to/from special purpose registers, mtcrf, mtcr, mtmsr, mtmsr 


TLB search, TLB read, TLB write, TLB invalidate all, TLB synchronize 
multiply low/high haltword and accumulate /subtract 





rlwimi 5 


cntizw 
cntlzw 
sraw 


sraw. | extsb 
srawi | extsb. 


rlwimi. 5 


lwinm 


or immediate values. 


lwinm. 


lwam 


by the upper 6 bits, in a 
field called the primary 
opcode. The remaining 
26 bits contain operands 
and/or reserved fields. 


Operands can be registers 


Arithmetic Instructions 


lwnm srawi. 


nor egy 
Nor. eQV. 


lw 
lw. 
stW 
W. 


S 


= 
(an) 


Table 3 - Power PC logical instructions 





Many instructions exist 
for performing arith- 
metic operations, includ- 
ing add, subtract, nega- 
tion, compare, multiply 
and divide. Many forms 
exist for immediate val- 
ues, overflow detection, 
andmcating b..And —OUt. 


Multiply and _ divide 
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instruction performance varies among 
implementations, as these are typically 


multi-cycle instructions. 


Logical Instructions 


Table 3 lists PowerPC logical instructions. 
Looking at the AND instruction, The “i” 
form means that a 16-bit immediate is used 
for the AND, the “is” form means that a 
16-bit immediate is used in the upper 16- 
bits of the AND. For all “.” forms, the 
CR[CRO] is previously 
described. PowerPC has the ability to per- 


form a 32-bit rotate-and- 


updated as 


combine with a mask in a 
single cycle. In the miscella- 
neous column are instruc- 
tions to count the leading 
zeros in a register, and sign 


extension instructions. 
Load/Store Instructions 


All loads/stores are per- 
formed using the GPRs. 
Instructions exist for byte, 


halfword, and word sizes. 


Special instructions include: 


¢ Multiple-word load/stores 
(Imw / stmw), which can 
operate on up to 31, 32- 


bit words 


e String instructions, which 
can operate on up to 128- 


byte strings 


¢ Memory Synchronization 
instructions lwarx (Load 
Word and 
Indexed) and 
(Store Word Conditional Index) are used 


Reserve 


StWCX. 


to implement memory synchronization. 
lwarx performs a load and sets a reserva- 
tion bit internal to the processor and hid- 
den from the programming model. The 
associated store instruction stwcx. per- 
forms a conditional store only if the reser- 
vation bit is set and thereafter clears the 
reservation bit. CR[CRO]EQ is set to the 
state of the reservation bit at the start of 
the instruction so that software can deter- 


mine if the write was successful. 
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Synchronization Instructions 


Commonly misunderstood PowerPC 
instructions are those that perform synchro- 


nization. These instructions include: 


¢ Enforce In/Order Execution of I/O (eieio) 
— This instruction is for data accesses to 
guarantee that loads and stores complete 
with respect to one another. Since 
PowerPC defines a weakly ordered storage 
model in which loads and stores can com- 
plete out of order, this instruction exists to 


guarantee ordering where necessary. 





e Synchronize (sync) — This instruction guar- 


antees that the preceding instructions com- 
plete before the sync completes. This 
instruction is useful for guaranteeing 
load/store access completion. For example, 
a sync may be used when writing memory 
mapped I/O registers to a slow device 
before making further access to the device. 


e Instruction Synchronize (isync) — This 
instruction provides ordering for all effects 
of all instructions executed by the proces- 


sor. It is used to synchronize the instruction 


context, such as memory translation, endi- 
anness, cache coherency, etc. Instruction 
pipelines are flushed when an isync is per- 
formed, and the next instruction is fetched 


in the new context. This instruction is use- 


ful for self-modifying code. 


Memory Management 


Memory management is used to translate 
logical (effective) addresses to physical (real) 
addresses. Memory management units 
(MMUs) are also used to control storage 
attributes, such as cacheability, cache write- 
though/write-back mode, memory coheren- 
cy, and guardedness. There are 
two primary approaches; one 
defined by PowerPC classic in 
the 600/700 family of proces- 
sors and another used by the 
400 family and Book E specifi- 
cation. In both cases, the archi- 
tecture defines a_ unified 
MMU, which has traditionally 
been implemented as inde- 
pendent instruction and data 
MMwUs, enabled via the MSR 
[IR,DR] _ bits, 


Below is an overview of the 


respectively. 


two approaches. 
PowerPC Classic MMU 


The PowerPC Classic MMU 
was designed primarily for 
demand page operating sys- 
tems such as UNIX or 
MacOS. There are two transla- 
tion mechanisms, one for 
block address translation, and 
another for page tables. Block 
address translation is per- 
formed using eight pairs (upper and lower) 
of address translation registers, four for 
instruction addresses (IBATU/L 0-3), and 
four for data accesses (DBATU/L 0-3). The 
BAT registers define page sizes ranging from 
128KB to 16MB. 


For systems requiring more translations than 
are found in the allocated BAT registers, page 
table translation is provided. A 32-bit effec- 
tive address is translated to a 52-bit virtual 
address, and is then translated into a physical 


address. One of 16 segment registers (SRO- 
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SR15) provide virtual address and protection 
information. Page Table Entries (PTEs) pro- 
vided physical address and page protection 
information. The architecture allows for 
implementations to provide translation- 
lookaside buffers (TLBs) to speed the trans- 
lation process, but does not define them. The 
page-tables are typically programmed by the 
Operating system and their discussion 1s 
beyond the scope of this article. 


400 Family/Book EMMU 


The Book E carries on the idea of a flexible 
MMU structure for embedded systems. 
Page sizes are programmable; a page can be 
large (up to a terabyte in the Book E archi- 
tecture) to simplify software and minimize 
the number of entries, or as small as 1KB, 
to avoid wasting memory space. In addi- 
tion to normal protection and translation 
mechanisms, endianness is defined by a 
page attribute. TLB misses result in an 
exception; it is under software control to 
handle the page miss algorithm. A TLB 
search instruction, tlbsx, assists in search- 


ing the entire LB array in a single cycle. 
Interrupts 


The PowerPC architecture provides a 
minimal hardware scheme for saving state 
on interrupts. The only registers that are 
saved are the IAR and MSR. Interrupt 
enable bits are disabled for the interrupt 
type that occurred in order to prevent a 
second interrupt from occurring before 
saving the context. Software must save all 
necessary registers — these typically 
include all user-mode registers and possi- 
SPRs. 


Exception-state saving is typically per- 


bly certain supervisor mode 
formed by an operating system, but note 
that for small exception vectors, time can 
be saved by only saving registers that 
would otherwise be corrupted. Operating 
systems must take a more universal 
approach and save all registers that may be 
necessary, even if some wind up not being 


touched by a particular exception handler. 


PowerPC Classic Exception 


Vector Processing 


A single interrupt hierarchy is defined. 


When an interrupt occurs, Save/Restore 
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Register 0 (SRRO) is loaded with the 
address of where processing should resume 
after the exception, and the machine state 
register is saved to SRR1. SRRO may be 
loaded with the current IAR or in some 
cases the next instruction. Interrupt vec- 
tors are located at either a high address 
(OxFFFn_nnnn if MSR[IP=1] or low 
address (0Ox000n_nnnn if MSR[IP=0]), 
depending on the instruction prefix bit in 
the MSR. The interrupt type determines 
the lower 5 bits of the vector. When pro- 
cessing is completed, an rfi instruction is 
executed to restore the IAR and MSR to 
the saved values in SRRO and SRRI. 


400 Family and Book E Exception 


Vector Processing 


Both the IBM 400 family and Book E 
define a two-level interrupt hierarchy: a 
non-critical interrupt class, and a critical 
interrupt class. The non-critical class reg- 
isters work as previously described for 
PowerPC classic. For critical interrupts, 
the IAR and MSR are saved to separate 
registers (SRR2 & SRR3, respectively for 
the 400 family, and CSSRO & CSSRI1 for 
Book E). When a critical exception is 
completed, an rfci instruction is executed 
to properly restore the machine. By hav- 
ing a dual-level interrupt scheme, 
non-critical interrupts can be more 


easily debugged. More than two sets of 


Instruction Mode Implementation 


User All 
Supervisor All 
User All 
User All 
User All 


Supervisor IBM 4xx 
User Ayx / Book E 


Supervisor IBM 4xx 


Table 4 - Cache management instructions 
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interrupt vectors are possible — for the 
400 family, the upper 16 bits of the 
exception vector is contained in the 
Exception Vector Prefix Register (EVPR). 
For Book E, all 16 exceptions can have 
the upper half of the exception vector 
mapped to a different location through 
the use of 16 Interrupt Vector Prefix 
Registers (IVPRO-15). 


Stack 


The PowerPC architecture has no notion of 
a stack for local storage. There are no push 
or pop instructions and no dedicated stack 
pointer register defined by the architecture. 
However, there is a software standard used 
for C/C++ programs called the Embedded 
Application Binary Interface (EABI) which 
defines register and memory conventions 
for a stack. The EABI reserves GPR1 for a 
stack pointer, GPR3-GPR7 for function 
argument passing and GPR3 for function 
return values. Assembly language programs 
wishing to interface to C/C++ code must 
follow the same standards to preserve the 


conventions. 
Caches 


The PowerPC architecture contains cache 
management instructions for both user- 
level and supervisor-level cache accesses. 


Cache management instructions are found 


in Table 4 below. 


Function 


Instruction Cache Congruence Class Invalidate 
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Care should be taken when porting cache 
different 
implementation. Although 


manipulation code to a 


PowerPC 
cache instructions may be common across 
different implementations, cache organi- 
zation and size may likely change. For 
example, code that makes assumptions 
about the cache size to perform a flush 
may need to be modified for other cache 
sizes. Also, cache initialization may vary 
between implementations. Some provide 
hardware to automatically clear cache 
tags, while others require software loop- 


ing to invalidate cache tags. 
Self-Modifying Code 


While it is not a recommended practice to 
write self-modifying code, sometimes it is 
absolutely necessary. The following 
sequence shows the instructions used to 


perform a code modification: 
1. Store modified instruction. 


2. Issue dcbst instruction to force new 


instruction to main store. 


3. Issue sync instruction to ensure DCBST 


is completed. 


4.Issue icbi instruction to invalidate 


instruction cache line. 


5. Issue isyne instruction to clear instruc- 
tion pipeline. 

6. It is now OK to execute the modified 
instruction. 


Timers 


Most implementations have provided a 64- 
bit timebase that is readable via two 32-bit 


registers. [he amount the timer increments 


varies across families, as well as the SPR 


numbers and instructions to access the 
timebase. Therefore, care should be taken 
when porting timer code across implemen- 
tations. Additional timers may also vary, 
but most provide at least one kind of decre- 


menting programmable timer. 
Book E Timers 
Both the IBM 400 family and Book E 


define the following timers in addition to 
the timebase: a 32-bit programmable 


decrementer (DEC in Book E, PIT for tions. These are typically things like 
a 


Prin 2.00 2a 
i a 
= —<— : 
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the 400 family) with an auto-reload 
capability, a fixed-interval timer (FIT), 
and a watchdog timer (WDT) for system 


hang conditions. 
Debug Facilities 


Debug facilities vary greatly between 
implementations. Original PowerPC 600 
family parts had only one instruction 
address breakpoint. PowerPC 700 family 
parts have added a single data address 
breakpoint. PowerPC 400 family parts 
have much more robust debug capabilities, 
including multiple 


address 


breakpoints, data 


instruction 


address breakpoints, 
and data value com- 
pares. Other features 
may include break- 
point sequencing, 
counters, ranges, and 


trace capabilities. 


Maintaining Code 
Compatibility 


PowerPC users who 
expect to program for 
more than one imple- 
mentation typically 
ask for tips on main- 
taining code compat- 
ibility. The following 
are some suggestions 
to help minimize 


porting problems: 


e Use C code when- 
ever possible. 
Todays C compil- 
ers can produce 
code that is compa- 
rable in perform- 


hand- 


assembly coding in 


ance to 


many cases. C code, being Book I code, 
will guarantee code portability. 


e Also, try not to embed processor-specif- 
ic assembly instructions in C, as they'll 
be harder to find. Separate processor- 


specific code that is known to contain 


boot up sequences and device drivers, 
but also may include floating point 
code (including long long types). Keep 
them well documented as to assump- 


tions and dependencies. 


Use the PVR, but only when appropri- 
ate. Common code across minor varia- 
tions of implementations is good, and 
the PVR can be used for decision mak- 
ing. But in the case where major modifi- 
cations are necessary (for example, 7xx 
versus 4xx MMU code), separate code 
bases are recommended. 





Summary 


This completes an introduction to the 
PowerPC programming model. IBM hopes 
you have found this of value, and that it 
adds to the success of your development 
programs. For more information, go to: 


device dependent registers or instruc-  www-3.ibm.com/chips/products/powerpe/ 
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t's avyt what you'd expect Zu BM. 


by Kalpesh Gala and Mike Vowell 
IBM 


It is not by accident that ideal targets for 
IBM PowerPC™ technology include wired 
and wireless networking, storage, and perva- 
sive computing applications. It’s all part of a 
master strategy to focus both today’s and 
tomorrow’s industry leading technology on 
PowerPC products — to meet the perform- 
ance, power, and price needs of these ever- 


dynamic and diverse applications. 


The focus of IBM’s strategy centers upon 
both technology and design expertise. 
Underlying this focus is a strong and long- 
term commitment to leveraging the PowerPC 
architecture and designing with standard 


interfaces, such as RapidIO™, PCI/X, and 


22 Xcell Journal 


Ethernet. The goal of this strategy is to 
assure the continuation of a product portfo- 
lio heritage of being core-based, power-effi- 


cient, scaleable, and software transparent. 





ket opportunities evolve for higher per- 
formance and/or lower power devices, 
IBM will be poised to address these seem- 


ingly insatiable needs. 


THE RAPIDIO INTERCONNECT ARCHITECTURE, DESIGNED TO RE COMPATIBLE WITH MOST POPULAR INTEGRATED COMMUNICATION 
PROCESSORS, HOST PROCESSORS, AND NETWORKING DIGITAL SIGNAL PROCESSORS, [8 A HIGH PERFORMANCE, PACKET-SWITCHED, 
INTERCONNECT TECHNOLOGY. IT ADDRESSES THE HIGH-PERFORMANCE EMBEDDED INDUSTRY'S NEED FOR RELIABILITY, INCREASED 
BANDWIDTH, AND FASTER BUS SPEEDS IN AN INTRA‘SYSTEM INTERCONNECT. THE RAPIDID INTERCONNECT ALLOWS CHIP-T0-CHIP 
AND BOARD-T0-BOARD COMMUNICATIONS AT PERFORMANCE LEVELS SCALING TO TEN GIGABITS PER SECOND AND BEYOND. 


The melding of IBM’s advanced technol- 
ogy, PowerPC products, and SoC capabil- 
ity will establish a hallmark in the battle 


for mindshare and marketshare. As mar- 


IBM’s PowerPC cores, microprocessors, 
and integrated products meet the unique 
needs of an increasingly diverse market- 


place. PowerPC chips offer state of the art 
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technology in a variety of configurations to 
provide the optimal mix of performance, 
power, functionality, and size. And because 
of our core-based design philosophy, IBM 
customers can quickly and easily differenti- 
ate their products in the marketplace, and 
still maintain flexibility and software trans- 


parency across generations of devices. 


IBM HAS BEEN A LEADER IN AWARDED PATENTS 
FOR MANY YEARS, LARGELY AS A RESULT OF [HM 
MICROELECTRONICS CONTRIBUTIONS. AMONG THE MORE 
RECENT OF THESE CONTRIBUTIONS ARE SEVERAL DOZEN 
PATENTS DIRECTLY RELATED TO THREE CHIP BREAKTHROUGHS 
— SILICON GERMANIUM (SIGE), SILICIN-ON-INSULATOR 
(SOD, AND LOW-K DIELECTRIC. 


JUST AS TM LED THE INDUSTRY WITH ITS 
COPPER PROCESS TECHNOLOGY, THESE NEW PROCESS 
TECHNOLOGIES WILL BE THE CATALYST FOR EVEN 
GREATER PERFORMANCE AND LOW-POWER ADVANCES 
IN THE SEMICONDUCTOR INDUSTRY. 


Key to enabling a high level of flexibility is 
the IBM CoreConnect™ on-chip bus 
architecture, which is becoming a defacto 
industry standard. CoreConnect provides a 
standardized method for assembling pieces 
of chip designs from diverse suppliers to 
facilitate an open SoC design process that 
encourages the development of reusable IP. 
Currently, this bus structure is licensed by 
over 40 IP providers, and is the basis for 
numerous IBM standard, application spe- 
cific, and custom devices. Forthcoming 
enhancements include higher bandwidth 
and crossbar functionality, which will 
marry well with our next-generation CPU 
cores and our planned addition of per- 
formance-enhancing IP like RapidIO and 
high-speed serial ports. 


Enhancing IBM’s strategy is a commit- 
ment to the PowerPC architecture. The 
latest enhanced version, called PowerPC 


Book E, has been refined to provide 64- 
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bit capabilities, increase flexibility, and 
address the unique demands posed by 
embedded systems. Book E is a new defi- 
nition of the PowerPC architecture, one 
that maintains compatibility with appli- 
developed for the 


cations original 


PowerPC architecture. 
What’s technology got to do with it? 


Almost everything. Its what boosts per- 
formance... throttles power consumption... 
and enables smaller devices. In a nutshell, 
technology is a key ingredient to the ever- 
elusive faster, smaller, and cheaper semi- 


conductor solution. 


Today's copper process, silicon-on-insulator, 
and Low-K Dielectric technologies have fos- 
tered higher levels of performance and lower 
power dissipation than were previously 
attainable. This elevation of performance is 
currently being realized in IBM’s highest 
performance servers and workstations for 
data processing and electronic commerce 
through IBM’s stand-alone and system-on- 
chip (SoC) processors. 


The application of new process technolo- 
gies is crucial to the development of proces- 


sor engines that will meet the performance 


and power needs of next-generation appli- 
cations. And nowhere will the infusion of 
new technology be more appreciated than 
in the relatively new “pervasive comput- 
ing.” This includes handheld and embed- 
ded products such as smart phones and 
Internet appliances for business profession- 
als and consumers. In this arena, both 
“high performance” and “low power con- 
sumption” are of utmost importance. To 
continue meeting this challenge, IBM is 
focusing its technology resources on 
PowerPC embedded processors that will 
distinguish themselves by their enviable 
power/performance ratios as well as their 
high integration of critical IP. 


Conclusion 


As a chief architect of the PowerPC 
Architecture, IBM has been at the forefront 
in the evolution of microprocessor design, 
and SoC 
advances in the industry. IBM likes its posi- 


semiconductor technology, 


tion as a pioneer and industry leader — and 
has no desire to relinquish its position. You 
are invited to meet IBM at any of its mile- 
stones, as it advances toward an ultrascalar, 
2+ GHz-processor engine. It’s exactly what 


you would expect from IBM. 


IBM PowerPC Strategic Roadmap 
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Development Tools 


bet up to 


Multi-Gigabit 


Speed 


with the 


SPECCTRAQuest 


Design 


Kit 


Learn how fo implement Rocket 1/0 multi-gigabit 
serial transceivers in the new Virtex-Il Pro Plattorm FPGA. 


by Donald Telian 
Technologist 

Cadence Design Systems 
donaldt@cadence.com 


Mark Alexander 

Product Application Engineer 
Xilinx, Inc. 
mark.alexander@xilinx.com 


With the introduction of the new Virtex-II 
Pro™ FPGAs, high-speed Rocket I/O™ 
serial transceivers are ready to find their 
way into hundreds of new applications. Are 
you ready? What used to be confined to a 
few exotic chips, laden with pages and 
pages of design and implementation guide- 
lines, is now available to everyone. Xilinx 
has made it possible to integrate multi- 
gigabit transceivers (MGTs) into your 
FPGA, but how will you integrate that 
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high-speed FPGA into your system of 
printed circuit boards (PCBs)? 


In an effort to avoid multi-gigabit 
headaches, Xilinx and Cadence Design 
Systems have assembled 
5 Pie, C Pree uest !™ 
MGT Design Kits to 
help you implement the 
new Rocket I/O MGTs 
effectively in your sys- 
tem. Whether you need to 
craft a custom MGT interface 
and develop your own 
PCB/backplane/cabling guide- 
lines, or you simply need to 
apply your MGTs in a stan- 
dard configuration, the kit 
gets you moving towards a 


solution within minutes. 


Figure I - Example of 
chip-to-chip simulation topology 





SPECCTRAQuest MGT Design Kit 


Pre-configured circuits in the design kits are 
ready to simulate for both typical MGT 
chip-to-chip and backplane PCB interfaces. 
There's no time wasted hunting for models, 
testing and correlating them, or figuring out 
how to connect them together. It has all been 
done for you. And because the simulation 
environment is graphical, adapting the cir- 
cuits for your unique application is as simple 


as dragging and dropping. 


Better yet, the models of the active Rocket 
I/O MGT circuitry are transistor-level sili- 
con models that have been correlated by 
Xilinx to match both the actual silicon 
design and empirical data. This ensures that 
your system implementation is designed 
with the most advanced and accurate mod- 
els available. Add to that fully coupled fre- 
quency-dependent lossy package, PCB 
trace, and connector models, and youre 
ready to carefully characterize the signal 
integrity and degradation issues that are 


inherent in this type of design. 


Figure 1 shows a pre-configured simulation 
drawing for chip-to-chip applications. 
Although this is actually a 500+ node simu- 
lation, it has been simplified through the use 
of subcircuits and black box models so it can 
be more easily modified. Just point and click 
to change trace parameters, physical connec- 


tions, or other aspects of the circuit. 


When your simulations are done and it’s 
time to layout your PCB, the MGT kit has 
sample footprints and constraint files to 
ensure a successful layout. It is essential to 

bind all high-speed constraints into your 
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Figure 2 - High-speed PCB design flow 


PCB database to automate and properly con- 
strain the layout process. If you dont bind 
your constraints, your layout, post-layout 
simulation, and actual system behavior will 


not be as clean as you would want. 


To help you along the learning curve associ- 
ated with designing multi-gigabit links, the 
kits also contain tutorial “movies” you can 
run on your PC to clearly illustrate how to 
use the various aspects of the kit throughout 


your design process. 


SPECCTRAQuest’s Features 
SPECCTRAQuest is an integrated 


design tool that allows you to include 
both your MGT silicon models and 
PCB databases in one simulation. You 
can easily set up extensive “sweep” sim- 
ulations to sweep circuit variables 
through a range of values to test out dif- 


ferent implementation options. 


Simulation of trace parameter varia- 
tions (such as loss tangents, skin effect, 
dielectric constant, and other varia- 
tions), crosstalk, power noise, deter- 
ministic/random jitter, and other effects 


are all included. 


Stimulate your circuit with pre-coded 
8b/10b pseudo-random bit sequence 
(PRBS) patterns to create inter-symbol 
interference (ISI) and study its effect on sig- 
nal integrity. With the push of a button, you 
can plot the resulting waveform as an eye 
diagram so you can quickly quantify the sig- 
nal degradation from transmitter to receiver. 
SPECCTRAQuest also provides automated 


measurements of each circuit’s performance. 


Spring 2002 


Constraint- 
Driven 
High-Speed 
PCB Layout 


Post-Route 
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Manufacture 
Assemble 
Test PCB 


These measurements can be sorted so you 
can quickly select the best configuration. 


After you have determined the best 
implementation configuration, you can 
use Cadence’s powerful constraint man- 
agement (CM) tool to electronically cap- 
ture and manage your layout constraints. 
The CM tool interacts with the other 
Connected Team Design Tools (including 
Concept HDL; Allegra SEE @@ ims 
and SPECCTRAQuest) to enable your 
whole design team to work concurrently 
and share data. 
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Figure 3 - Simulation from layout and extracting a differential net 


SPECCTRAQuest has a unique way of 
superimposing advanced simulation on top 
of high-speed PCB layout. For more infor- 
mation on the features available in 
SPECCTRAQuest and other Cadence PCB 
tools, please refer to www.specctraquest.com 
and pcb.cadence.com. 


MGT Design & Implementation Flow 


With both the SPECCTRAQuest software 
and MGT design kit in place, you're ready to 
exercise the advanced, high-speed design flow 
shown in Figure 2 to ensure your MGTs are 


implemented correctly. 


The flow begins by configuring a simulation 
of your unique application. Use the pre- 
configured topologies in the design kit as a 
starting point. Simulate your application 
pre-route by sweeping through all the imple- 
mentation options and manufacturing 
tolerances to ensure that the multi-gigabit 
signals are transmitted and received within 
tolerances. Use the test patterns and post-pro- 
cessing tools in the kit to help with this phase. 


Once an acceptable solution is found, cap- 
ture the layout constraints in electronic 
topologies that can be bound into the layout 
process. Once again, you can use the kit 
examples to learn how. Perform constraint- 
driven layout, and then do post-layout sim- 
ulation on the routed nets to verify that 
signal transmission is still within tolerance. 
If the layout constraints were applied and 
followed, this will be a simple verification 
step. If a problem is found, the differential 
net can be extracted to an electrical view to 
more easily identify the cause of failure, as 


shown in Figure 3. 
Conclusion 


Although using MGTs in your 
high-speed design can bring high- 
er performance to your FPGA 
application, you must be careful 
in performing the implementa- 
tion. By correctly using the 
SPECCTRAQuest MGT Design 
Kit assembled for this task, you 
can avoid common problems and 
shorten your design cycle. The kit 
includes layout guidelines, con- 
straints, topologies, scripts, and 
utilities that will help you inte- 
grate Rocket I/O MGTs into your high-speed 
Virtex-II Pro designs. 


The MGT design kit is available for free 
download from the Spice Suite at www.xil- 
inx.com. Registration is required if you are 


not already a Xilinx customer 
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by Jay Gould 


Embedded Partners Program Manager 


Xilinx, Inc. 
= = jay.gould@xilinx.com 
Xilinx Platform FPGAs introduce a 


completely flexible and programmable 


platform for creating custom and 
upgradeable embedded _ solutions. 
However, using high-performance 
processor cores can be a challenge. To 
maximize your productivity, you need 


best-of-class embedded software devel- 


opment tools. 


_* ,  Time-to-market pressures and product 

sien | differentiation goals often create con- 
‘— . flicts with engineering schedules. That’s 
he — why more companies are moving away 


from home-grown tool environments 





and their associated support problems. 


In this competitive climate, it’s usually 


better for you to use the best, commer- 
cially available tools — and focus your 
engineering resources on unique, value- 


added product design. 


Virtex-Il Pro Requires 


A compiler tuned tor a specitic Embedded Software Tools 
() () S processor architecture automatically The term “embedded software tools 
produces tight, fast code. r ena! = | most often applies to the tools required 


to create, edit, compile, link, load, and 
debug high-level language code (usually 
C/C++) for execution on a processor 
engine. The processor could be hard or 
soft; 8-, 16-, 32-, or 64-bit; high or low 
performance, and so on, but the basic 
development flow is generally the same. 
With Virtex-II Pro™, you can target 
design modules for either silicon hard- 
ware in FPGA logic gates, or as software 
applications run on processor engines 
like the embedded IBM PPC405 hard 
core. Since hardware engineers, software 
engineers, firmware engineers, system 
architects, and others may all target the 
Virtex-II Pro, Xilinx has a “market 
leader” tools strategy to appeal to these 
different camps. This strategy also has 
the added advantage of appealing to the 


largest installed base of embedded users. 
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Xilinx could have created new embedded 
software tools from scratch, (in fact, we 
employ more software engineers than most 
“software” companies), but instead Xilinx 
chose to launch the Virtex-II Pro with 
“Xilinx versions” of established third-party 
tools. Therefore you don’t have to embrace 
completely new development methodolo- 
gies and you can port existing designs into 
the Virtex-II Pro fabric. With tens of thou- 
sands of engineers already using these tools, 
using a complementary model is far more 


constructive than creating a new technology. 


For an embedded processor core to run 
machine code, the algorithm must be 
entered in an HLL (high level lan- 
guage) such as C or C++ rather than 

in an HDL (hardware description 
language). A compiler is used to 
translate that HLL design into 
specific (PPC405) binary code 

that can be executed on that 


particular processor. 


Bad compilers produce 
poorly optimized, bloat- 
ed code that takes up a 
lot of memory and 
runs slowly. A best- 
of-class compiler is 

one which has been 


tuned for a specific Figure 1 - 


processor architec- Software tool flow 


ture and automati- 

cally produces tight, 

fast code. This saves memory for the 
design and spares you days of hand-opti- 
mizing, trying to manually accelerate your 
code. For the integration and debugging 
stages, a software debugger is run on a 
host computer or workstation, and con- 
trols the program execution of the embed- 
ded target over a hardware connection 
probe. With a Virtex-II Pro and the 
PPC405, this communicates directly with 
the processor core using a JTAG port. 


Partnering with Wind River Systems 


Xilinx worked with Wind River Systems, 
the market leader in the embedded software 
industry, to provide a robust set of software 
tools for targeting the PPC405 in Virtex-I] 
Pro FPGAs. Wind River Systems has a 
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commanding market share with their broad 
range of development tools, Real Time 
Operating Systems (RTOSs), and middle- 
ware solutions. A specific Virtex-II Pro ver- 
sion of the Wind River tools (compiler, 
software debugger, and JTAG run control 
hardware probe) have been created for 
Xilinx distribution via an OEM agreement. 


The Wind River tools are optimized for 
the PPC405. The Virtex-II Pro versions of 
these tools are created with some basic 
design size limitations so that Xilinx can 


package a lower-price entry point tool 


suite through the Xilinx sales channel. You 













can buy the unrestricted versions (or 


upgrades) from the Wind River sales 
channel. Wind River already supports the 
PPC405 and will also offer numerous 
other embedded tools. 


Wind River makes numerous tools them- 
selves and have a huge partnership pro- 
gram where third parties build other tools 
to integrate within the Wind River 
Tornado IDE (integrated development 
environment). This opens the market to 
numerous other software tool suppliers 
who can specialize in niche areas, provid- 
ing extra value to end users. Wind River 
owns multiple Real Time Operating 
Systems (RTOSs), but their main market 
focus is on an RTOS product called 


VxWorks, which is also the market leader. 


The Wind River OEM tools for Virtex-II 
Pro and PPC405 include: 


¢ Diab XE — compiler 
e SingleStep XE — software debugger 
¢ visionPROBE II XE — JTAG run control 


hardware connection probe 
“XE” stands for “Xilinx Edition’. 


Optimized Compilers Provide 
High-Performance Code 


The Diab C/C++ Compiler consistently 
scores high in the 
annual market research 
results for best compil- 
ers. The fact that Diab 
is highly optimized for 

the PowerPC and _ has 
long been a market 
leader made it a perfect 
fit for the Virtex-II Pro 
and embedded PPC405 


core. 


A compiler turns high 
level language into 
machine executable code 
for a processor. Providing 
fine grained control of the 
compiler options, Diab is a 
full featured product which 
allows you to balance speed, 
code size, and memory usage 
for your applications. At the 
front end of the Diab product is a lan- 
guage parser which creates a language- 
independent representation of your code, 
and this unleashes the power of the five 
different back-end optimizer stages of the 
compiler. Diab provides optimization for 
global, code selector, code generator, peep- 
hole, and instruction scheduler stages as 
well as supporting architecture-specific 
features such as the SPRs/DCRs on the 
IBM 405 processor. 


Both the Diab Compiler and the 
SingleStep Software Debugger are consid- 
ered “best of class” products in the PPC 
embedded domain. The Xilinx PowerPC 
OEM software product is based on these 
top-of-the-line products. 
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Debugging Embedded 
Software on 

the Real Target 
Hardware 


SingleStep is a multi- 
windowed and full fea- 
tured embedded soft- 
ware debugging tool 


= Jet Gl 


“epaneaa 
that is far superior to 
command line tools. 
This product is ideally 
suited for board/hard- 


ware development, driv- 


5 
Ga Bae 

¥ = on 

oo en ol onl 


develop- 


er/firmware 


= oad 


ment, and_ software 


El i te 
hae 


application debugging 
(thus the 


requirements from two 


mn 


reducing 


am al ER Real 
a 


or more possible tools to 


just one). Software 
debuggers need to pro- 
vide basic “run, start, 
stop” control to debug an embedded system, 
but SingleStep exceeds these expectations as 
one of the most feature rich embedded 


debuggers available. 


SingleStep supports real-time target con- 
trol, high-speed downloads, built-in hard- 
ware diagnostics and Flash memory pro- 
gramming. It provides a unique processor- 
specific register interface to enable config- 
uring and initializing integrated peripher- 
als, and a command line interface with 


scripting language to automate testing. 


This debugger provides superior PPC405 
support through hardware breakpoints — 
including four hardware instruction 
address, two data instruction address and 
two data value breakpoints — and a register 
window. In addition to on-chip trace, 
SingleStep provides visibility for instruc- 
tion completion, branch taken, exception 
taken, data address compare, and other 


debug events. 


Connect Software Tools to 
the Embedded Hardware 


To download new FPGA images or run the 


host debug software tools on the real 
embedded system, visionPROBE I pro- 


vides a high-speed parallel connection 
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Figure 2 - SingleStep software debugger 


between the SingleStep debugger on the 
host and the Virtex-II Pro target. This 
product allows JTAG run control of the 
target for system debugging and, via hard- 
ware caching logic, the visionPROBE II 
can execute high-speed downloads up to 


400 KBps. 


The probe facilitates register initialization, 
hardware diagnostics, Flash programming, 
and hardware breakpoints. Additionally, 
this one probe allows for FPGA 

image downloads and 
Chipscope!™ 
Operation on 
the same target 
without hav- 
ing to add or 

change cables. 


RTOS Validation 


Figure 3 - 
vistonPROBE IT 


Xilinx will provide some validation sup- 
port for Real Time Operating System 
(RTOS) environments. Depending on the 
complexity of your application, an embed- 
ded system design may or may not require 
a hard-real-time operating system. If you 
are experimenting with a gate-consuming 
protocol or algorithm, some simple C/C++ 
code targeted for a Xilinx MicroBlaze™ 


soft processor may be all that is required. 





For other more complex 
applications, which have 
tight requirements for 
fast interrupts, maximum 
uptime and minimum laten- 


cy, the PPC405 and a robust 
RTOS may be required. 


Wind River — VxWorks 


Due to the close partnership 
with Wind River, Xilinx will 
validate that the Virtex-II 
Pro development boards, 
and our tools, will work 
seamlessly with the Wind 
River VxWorks RTOS. In 
fact, the entire support of the 
Virtex-II Pro will be validat- 
ed in the Wind River envi- 
ronment, including the cer- 
tification of a Xilinx “Board 
(BSP) 


that matches with our reference board. 


Support Package” 


The certification is a formal process 
that is 
executed 


| | by Wind 


River and through which they publicly 
acknowledge a working configuration of 


our tools, boards, and devices. 
Conclusion 


If full programmable 
embedded 
design with 
Virtex-Il 


what you are after, 


system 
the 


Pro 1s 


adding the software 
development flow to 
your FPGA logic tools is the 

next step. Rather than create a new 
methodology and introduce a new tool 
suite to a mature market, Xilinx has cho- 
sen to partner with the market leader to 
access the best-of-class tools for the 
PowerPC. Proven and established tools 
allow you to focus on adding value to 
your product design, without learning the 
nuances of immature or un-optimized 
Xilinx 


solutions at: 


tools. Read more about the 
embedded 


www.xilinx.com/processor 


software 
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SE 4.2i Expands 
High-Powered 
Design Productivity 


SE 4.2i extends ifs coverage to new Xilinx logic devices 


and even greater design tool productivity. 
A. Wy 


by Lee Hansen 

Software Product Marketing Manager 
Xilinx, Inc. 

lee.hansen@yilinx.com 


In February, Xilinx released Integrated 
Software Environment version 4.21 logic 
design software. ISE 4.21 delivers several 
new capabilities that make it the most 
complete package available for program- 
mable logic design. Building on the 
advances of ISE 4.1i, released this past 


August, version 4.2i offers several advan- 
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tages in particular to designers looking at 
embedded logic systems and high-speed 


signal challenges: 

¢ Linux ISE 

e Expanded device support 

e High-speed design support 

e Partial reconfigurability 

e Streamlining XPower performance 


¢ MXE-II 


Linux ISE 


ISE version 4.21 is the first version of Xilinx 
design software to run on the Linux operat- 
ing system. Now, you can run Xilinx imple- 
mentation tools on Linux Red Hat version 
7.2, including the Wine windows applica- 
tion layer. With the addition of the Linux 
operating system, you now have one of the 
widest choices possible in design system 
platforms and implementation tools for 


programmable logic design. 
Expanded Device Support 


ISE 4.21 includes the latest device support 
for all of the Xilinx families, including the 


new Virtex-II Pro™ product line. 


¢ The Virtex-II Pro Platform FPGA is the 
only programmable device available with 
3.125 Gbps serial I/O — and anywhere 
from zero to four IBM PowerPC™ 405 
embedded in the 


microprocessors 


device fabric. 


eISE 4.2i comes ready to implement 
Virtex-II Pro functions, as well as those 
available in the recently announced 
CoolRunner™-II and Spartan™-IIE 


device families. 


eISE 4.21 also includes the new “-6” 
speed grade for the industry's fastest 
Virtex-II] FPGAs. 


¢ By delivering FPGA and CPLD device 
support from a single design product 
line, ISE gives you an easy way to move 
up or down device families without hav- 


ing to load or relearn new software. 
High-Speed Design Support 


With Virtex-II Pro Platform FPGAs, you 
can now drive design I/O at speeds up to 
3.125 Gigabits per second — a first in pro- 
grammable logic. And ISE 4.21 comes 
ready to help you realize that high-speed 
I/O potential quickly and easily. All of the 
implementation tools, the pin planner, 
and timing and constraints editors in ISE 
4.2i have been enhanced to simplify the 
implementation of the high-speed I/O of 
the Virtex-II Pro devices. And the synthe- 
sis tools can optimize the paths to and 
transceivers 


from the multi-gigabit 
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(MGTs), a capability unique to ISE 4.21 


design software. 


The ISE 4.2i libraries also support instanti- 
ation of 1, 2, and 4-byte flavors of the high- 
speed I/O protocols supported by Xilinx. 


Xilinx and Cadence Design Systems have 
jointly developed high-speed design kits. 
These kits include software components 
and design aids to help you use the Virtex- 
II Pro multi-gigabit transceivers. The kits 
also contain HSpice compatible models for 
analyzing PCB electrical trace effects com- 
ing off the high-speed Virtex-II Pro I/O 
pins. By combining the new HSpice models 
for the Virtex-II Pro MGTs, IBIS models 
for all our FPGA device families 
and SWIFT models describing the 


while the remainder of the FPGA contin- 
ues to run. Products can be updated for 
new functionality while still in the field, 
and the product doesn’t have to be taken 


offline while new functionality is loaded. 
Streamlining XPower Performance 


XPower, an ISE feature that estimates the 
power consumption of FPGAs and CPLDs, 
has also been streamlined for better per- 
formance. VCD simulation files now read in 
much faster than previous versions, allowing 
for quick and accurate setup of input signal 
activity. And XPower now supports the new 
CoolRunner-II family of CPLDs. 


lation capacity and performance needed to 
verify the ever-increasing repertoire of 
faster and denser programmable devices. 
These additional capabilities are user-trans- 
parent, requiring no time-consuming 
learning curves. In fact, MXE-II simula- 
tion software requires less from the end 
user because of the integration with [SE’s 
HDL Bencher™ graphical test bench gen- 
eration environment. HDL Bencher wave- 
forms are automatically translated into 
VHDL or Verilog, simulated with MXE- 
TT, and (ieilexpeetccusrestiliwmeanmne 
back-annotated into the original wave- 
forms. Discrepancies between expected 

and actual results are also high- 

lighted, expediting dynamic 


verification. 


wreirarne | ¥o-0| Operating Consinara | Loans | Actertea | vo [Done] 
— Moreover, MXE-II includes 


optimized libraries that deliver 


behavior of the PowerPC™ 


microprocessor and multi-gigabit 


aoe rates. 


/Cie_BLP OPA iF 


transceivers, Xilinx provides sup- 


faster simulation runtimes 


port for analyzing your FPGA, 


even after its been programmed 


and ready for the board. 
Partial Reconfigurability 


Continuing its long string of firsts 
in the programmable logic indus- 
try, Xilinx has added a new capa- 
bility to both the ISE software 
family, as well as the Virtex-E and 
Virtex-II product lines — Partial 
Reconfigurability. Introduced in 
ISE 4.21, Partial Reconfigurability 
allows either a Virtex-II or Virtex-E FPGA 
to be partitioned so that part of the device 
can be reprogrammed, while the remainder 
of the FPGA continues to run. 


Partial Reconfigurability works through 
the Modular Design option for ISE. Using 
Modular Design, the overall design can be 
partitioned into sub-modules that can be 
implemented independently of each other. 
Once a module is completed, its timing 
and performance are locked down while 
the remaining modules are being finished, 


delivering faster overall design completion. 


With the addition of Partial 
Reconfigurability, the device can also be 
partitioned where electronic functions 
make the most sense. Then later on in the 


field, one partition can be reprogrammed 
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Figure 1 - XPower New Design Wizard 


XPower New Design Wizard 


Most important, in the ISE 4.21 version of 
XPower you will also find the “XPower 
New Design Wizard,” shown in Figure 1. 
The wizard helps XPower users read in 
design data, input VCD files if available, 
set default parameters, and identify and 
then set specific input activity rates — all in 
a tab-delineated form that’s easy and intu- 
itive to use. This new wizard helps you get 


more accurate thermal estimates faster. 
MXE-II Simulation Software 
ModelSim™ Xilinx Edition I] (MXE-II) 


simulation and debug software is now 
available to all ISE customers. Based on 
Model ‘Technology’s ModelSim 5.5e soft- 
ware, MXE-II delivers the additional simu- 


than competing technologies 
for post-route non-timing/tim- 
ing simulation. These libraries 
are available for all the device 


families supported in ISE. 





These enhancements — coupled 
with a new, fully automated 
licensing process — significant- 
ly expand MXE-II’s usefulness 
to a broader set of PC-based 


customers. 


MXE-II performs 33% better 
than its predecessors. That, plus the dou- 
bling of MXE-II’s capacity, makes it ideal 
for the verification of all types of Xilinx 
devices, including the CoolRunner-I], 
Spartan-IIE, and Virtex-I families. 


Conclusion 


ISE 4.21 continues to deliver the speed 
you need — from the simplest designs in 
the smallest device families to the most 
demanding high-speed, embedded sys- 
tem designs. Upgrade today to get the 
most leading edge capabilities available 
for your programmable logic products. 
You can find out more about ISE 4.21 at 
www.xilinx.com/ise/421. For a recorded 
e-learning lecture on ISE 4.21, go to 
www.xilinx.com/support/training/ 


north-america-home-page. htm. 
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by Frank L. Toth P 
Marketing Manager, Configuration Solutions Division 
Xilinx Inc. 

frank. toth@xilinx.com 


Xilinx offers a variety of System ACE™ 
configuration options to meet a wide 
range of configuration speeds, densities 
(bitstream size), and costs, as shown in 
Table 1. These pre-engineered solutions 
simplify the configuration of our FPGAs 
and help you get your design to market as 
quickly as possible. 


The System ACE CF solution, shown in 
Figure 1, leverages the tremendous density 


of commercial compact flash memory 






Multiple Designs 
S/W Storage 

Removable 

IRL Hooks 

Density it - 


Performance 
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#Components 
Min. Board Space 
Space /bit 





32-256 Mbit/sq. in. 


Table 1 - System ACE configuration options 
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devices to give you the Ditkefit of industry- 
wide increases in speed and density. When 
power is applied, the System ACE CF solu- 
tion uses the JTAG port to configure the 
Virtex-II Pro™ fabric and the embedded 
processor, as shown in Figure 2. Virtex-Il 
Pro FPGAs can also be easily accessed by 
the Xilinx ChipScope™ debugging tools 
using this same JTAG port. 


The System ACE CF solution uses the 
industry standard FAT file management 
system, which enables you to manage 
both bitstream and software files like disk 
space. Multiple configuration bitstreams 


can be selected and managed under soft- 
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Figure I - 
System ACE CF 


overview 
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ware and hardware control. Built In Self 
Test (BIST) files at the system level can be 
loaded and used to reconfigure the FPGAs 
to perform system-level I/O, network 
communication, functionality, and mem- 
ory tests. Then the FPGA can be reconfig- 


ured for the mission mode. 
Built-in Configuration Controller 


In a system with a Virtex-II Pro device along 
with other Virtex™ and Spartan™-II 
FPGAs, the Virtex-II Pro device may be 
configured first. Then the embedded 
processor can be used as a configuration 
controller eliminating the need for a sepa- 


rate processor. The Virtex-I] Pro FPGA 


Automatic Test Equipment 
JTAG Test Tools ei _ 
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can then be reconfigured allowing one 
processor to be used for configuration, 
system test, and the mission mode. 
Because the processor and the intercon- 
nect fabric of Virtex-II Pro FPGAs can 


easily be programmed for multiple func- 





PowerPC 
Processor 


tions, you can use the optimal configura- 
tion mode including JTAG, Slave Serial, 
or SelectMap modes. 


As an alternative configuration sequence, the 
Virtex-IT Pro FPGA can be “booted,” as 
shown in Figure 3, using a smaller boot file 
provided by the System ACE solution on 
power up. [he embedded processor can then 
take control of the configuration process and 
in turn configure itself and the rest of the sys- 
tem by sending the appropriate interface and 


control signals to the System ACE solution. 





PowerPC 
Processor 





Other Configuration Options Figure 4 - 
System ACE 

Xilinx also offers several other pre-engi- multi-package 
module 


neered configuration solutions including 
serial PROMs, the System ACE MPM 
(Multi-Package Module, shown in Figure 4) 
and the System ACE SC_ (Soft 
Controller). The density of the System 
ACE MPM solution ranges from 16 to 64 
megabits (or more by using the System 
ACE bitstream compression software) and 
features a Slave Serial or Select Map con- 
figuration port output. The native 
flash/microprocessor interface allows the 
System ACE MPM solution to be con- 


nected to the microprocessor bus. 
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If you want to use a different form factor 
and mount the components directly on 
tho==boal dre tie System eater SSOft 
Controller IP is available. This pre-engi- 
neered solution is identical in functionali- 
ty to the System ACE MPM solution and 
you can download the IP free of charge. 


Taking Full Advantage of 
System Re-Configurability 


With the advent of the embedded proces- 
sor and the other system-level features of 
the Virtex-II Pro family, the use of multi- 
ple configurations becomes more useful, 
both to increase system flexibility and 
extend product life. Multiple configura- 
tion files can also be used as a vehicle for 
loading BIST and for testing system func- 


tionality, integrity, and performance. 
Conclusion 


With the Xilinx System ACE solutions, 
you can easily control both the configura- 
tion of your FPGAs and automatically 
load your embedded processor software. 
There is no easier way to configure your 


Virtex-II Pro designs. 


For more information on System ACE 
products, go to: www.xilinx.com/xInx/ 


xil_prodcat_product.jsp?title=system_ace 
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Want to DUMP UP 
your FPGA processing 
pertormance? 





























We can show you how we trans- 

formed a soft 32-bit microprocessor 

core running at 50 MHz ina 

Virtex-I] FPGA into a cruncher that 

can do operations that would challenge 
GHz microprocessors. 


If you want to do real-time video manipulation 

and storage while processing audio and handling the latest 
networking protocols — add “Xyronium™” technology 

to your multi-tasking embedded control environment and 
realize incredible performance gains. 


Need to do some heavy duty processing? Order one of our 
development boards today to realize your full potential. 


Find out how Xyron unleashes the untapped 
power of today’s processors. 


Visit us at Embedded Systems 
Conference — San Francisco 
Booth 639s or at “Xilinx 
Programmable World 2002”, 
or see us online at 
www.xyronsemi.com. 





See us at the Xilinx 


www.xyronsemi.com 
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The system-on-a-chip (SOC) market has 


experienced steady and _ consistent 


growth. Dataquest estimates roughly half 
of all ASIC design starts are SOC based. 
That percentage is projected to reach 
80% by 2005. There are several reasons 
for this clear shift in design methodology. 
Some of the obvious advantages include 
greater Component integration, increased 
speed (Logic <-> Processor), lower 
packaging and test costs, and 
increased overall system reliability. 
All of these combined can potential- 
ly make significant contribu- 
tions to achieving the often elu- 

sive but always important goal of 


accelerated time to market. 


Programmable logic device vendors 

have entered the SOC solution space 
with the introduction of a new class of 
devices known as Platform FPGAs — with 
the most recent addition being the Virtex- 
II PRO™., These devices offer the same 
level of integration as ASIC SOCs, but in 
contrast, Platform FPGAs facilitate the 
development of a wide range of applica- 
tions on the same chip. This article focus- 
es on seven of the key advantages of the 


Platform FPGA approach. 
#1 — Pre-Engineered Platform 


Platform FPGAs integrate several fixed 
and predetermined blocks of hard IP 
(intellectual property) components (sys- 
tem elements) within the programmable 
fabric. Notable among these are high-per- 
formance RISC CPUs, multi-gigabit and 
high speed I/Os, block RAM, system 
clock management, and dedicated DSP 
processing hardware. This powerful 
assembly and harmonious blend of com- 
ponents creates a cohesive system design 
environment. This environment offers 
unprecedented flexibility and perform- 
ance, thus enabling the deployment of a 


wide range of applications. 


The critical technological breakthrough is 
in the ability to tightly interface the vari- 
ous elements into the programmable fab- 


ric. Without this tight integration, much 
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of the speed benefits could not be realized. 
The fact that the choice of system elements 
is already made greatly simplifies the 
design and development process. A fixed 
architecture is particularly beneficial for 
software tool and IP providers in allowing 


them to deliver better value, customiza- 


tion, and architecture-optimized solutions. 





The assembly of the various hybrid IP 
blocks in an ASIC adds substantial com- 
plexity and hardship to the users’ design 
and development environment, because of 
a variety of issues relating to tool and IP 
interoperability, physical layout, timing, 
and system verification. For Platform 
FPGAs, on the other hand, it is much eas- 
ier to tailor and optimize components — 
such as silicon, software, support, and IP — 
because FPGAs represent a fixed and pre- 


engineered target. 


Summary: A fixed, pre-engineered but 
programmable FPGA solution offers a 
more productive and efficient develop- 
ment environment from both the software 


and silicon perspective. 
#2 — Process Technology 


PLD vendors have been able to extract 
great value and benefits from Moore’s Law 
and shrinking device geometries. While 
the majority of ASIC design starts are at or 
higher than 180 nm, FPGAs have raced 
ahead to bring the cost and performance 
advantages of 130 nm to its customers. 
This ability to rapidly migrate to the lead- 


ing edge of technology is essential to deliv- 
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ering the performance, capacity, and inte- 
gration that is necessary to challenge and 
displace the current established platforms 


of system design. 


For instance, FPGAs now make it entirely 
feasible to build systems of up to 2M (ASIC) 
gates with CPU(s) running at 400 MHz, 
serial I/O channels at 3.125 Gbps CLB fab- 
ric-switching at 300 MHz, and the entire 
system clocking over 150 MHz. This sur- 
passes the projected sweet spot of ASIC 
SOC designs. 


Looking at it in pure economic terms, 

the costs for a typical mask set for a 
130 nm ASIC run into the $700K+ 
range. That raises the bar for entry into 
the ASIC space, making the Platform 
FPGA an even more attractive option for a 


growing percentage of all SOC designs. 


Summary: FPGA silicon is best-in-class in 
process and engineering, thereby deliver- 


ing best-in-class system performance. 
#3 — Software Tools and Methodology 


It is well known that EDA designer pro- 
ductivity for ASICs is lagging behind 
recent silicon advances. The ability to cre- 
ate a productive design and debug environ- 
ment is absolutely essential to the success of 
any silicon platform. Therefore, software 
plays a critical role in not only making the 
platform easy to use and work with, but 
also in extracting the most performance 


and device utilization out of the silicon. 


The goal is to insulate the user from hav- 
ing to learn extraneous details about the 
platform, yet providing empowerment and 
control when and where it is needed. To 
this end, FPGA vendors have created cus- 
tomized and user-friendly processor sys- 
tem generator tools that aid in instantia- 
tion, initialization, and configuration of all 
the various system component blocks. In 
addition, these software tools automate 
otherwise manual and error-prone tasks, 
such as the interconnections among the 


processor, its peripherals, and buses. 


Design entry is engineered to tightly cou- 
ple the HW/SW domains. Such engineer- 
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Figure I - Software makes use of idle processor time. 


ing leads to not only simplified and easy- 
to-use design flows, but it ensures reliabil- 
ity and robustness from the very start by 
performing design rule and data consisten- 


cy checks across the two domains. 


Two examples of the advantages of cross- 


domain HW/SW co-design include: 


e Automatic generation of device drivers 
and header files for SW engineers once 
a particular block is instantiated by the 
HW designer 


¢ Tools to automatically populate SW 
binary code into appropriate FPGA 


memory bitstreams. 
Summary: Software sells silicon. 
#4 — Advanced Debug 


The importance of finding problems early 
cannot be overstated. Yet, up-front ASIC 
verification is extremely designer- and 


computer-resource intensive. Compared 
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to systems on a board, SOC limits visibil- 
ity into the internal nodes of the system, 
making the task of verification and debug 


more challenging than ever. 


The critical part includes the verification 
of complex interactions between the 
application software and the custom- 
designed peripheral hardware. ‘Traditional 
HDL-centric verification and debug tech- 
niques can no longer deal with the rising 


complexity of system designs. 


Consider a typical application, such as 
MPEG A/V decoding, where a large 
number of simulation cycles are required 
to complete a small sequence of frames. 
Co-verification tools in this case would 
either take an impractically long time to 
complete or validate only a mere fraction 
of the software code, falling far short of 
what it takes to find problems in the 
HW/SW interface. 


FPGAs overcome this problem in large 
part by being able to provide access to real 
or near real targets at a very early stage in 
the design cycle. Among other things, this 
means that SW engineers using FPGAs 
can quickly and easily sort out logic and 
design flaws by targeting real silicon. The 
engineers do not have to rely on ineffi- 
cient ASIC-centric techniques like co-ver- 
ification or writing stub code. Application 
software can be debugged at system speeds 
with full hardware and software register 


access and control. 


Additionally, the FPGA fabric allows for 
construction of highly customized and 
value-added cores to enable powerful real- 
time, on-chip debug capability. Some 


examples of such instrumentation include: 
¢ Logic and bus analyzer functions 
¢ Bus protocol compliance monitors 


¢ Memory buffers for debug and trace 
port data 


¢ Cross-domain triggers and breakpoints 
e Hardware run control of the CPU 
¢ HW/SW time synchronization logic. 


Furthermore, a single cable is able to per- 
form multiple functions, like debugging 
hardware, debugging software, as well as 
programming the FPGAs. This greatly 
simplifies the lab setup making it much 


easier to exploit the debug advantages. 


Summary: Platform FPGAs offer a clear- 
er, more cohesive, and overall more effec- 
tive debug strategy. Specifically, Platform 
FPGAs offer up-front silicon access along 
with unprecedented visibility and control 
of the processor and its peripherals resid- 


ing in the programmable fabric. 


#5 — Top Tier Partnerships and 
Vendor Tools Support 


In extending the concept of traditional 
programmable logic to Platform FPGAs, 
certain critical technologies have had to 
be developed or acquired. One of most 
exciting aspects brought forth by FPGA 


vendors has been to successfully forge 
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strategic partnerships with vendors hold- 
ing various system technology compo- 
nents. Driven largely by the successes of 
the FPGA business models, leading ven- 
dors have been eager to partner in fulfill- 
ing the vision of building powerful pro- 


grammable systems platforms. 


Areas of cooperation include partnerships 
in the form of cutting-edge process tech- 
nology, powerful mainstream processing 
elements — such as RISC CPU, high- 
speed I/O — and other components 
deemed of significance to a system design 
platform. IBM, Conexant Systems, Inc., 
Wind River Systems, and other high pro- 
file vendors are currently engaged in such 
strategic partnerships. What this means to 
you is there is no need for these partners 
to negotiate licensing, royalty, and inte- 
gration issues with individual vendors, 
thereby greatly reducing your engineering, 


management, and accounting overhead. 


The appeal and draw of FPGAs has 
caught the attention of independent SW 
tool vendors. Increasing numbers of ven- 
dors are able to sustain business models 
selling to FPGA customers. Several new 
vendors are setting up shop to write cus- 
tom tools to help enhance and exploit the 


unique capabilities of Platform FPGAs. 
Summary: The FPGA business model has 


attracted top-tier silicon and IP vendors to 
forge strategic partnerships to create pow- 
erful system design platforms. Software 
vendors are able to financially justify 
investment in research and development 
leading to a continuous stream of an 


increasing number of innovative solutions. 
#6 — Application Space: Co-Design Flexibility 
HW combined with 


processors on a single chip softens the 
HW/SW design boundary. By using 


hardware-assisted architectural explo- 


Programmable 


ration, designers can optimally search for 
the right HW and SW partitioning, 
which leads to the increased probability 
of being able to meet performance and 
Sequential 


area targets. computing, 


exception handling, and control func- 
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tions, for instance, programmed in HW 
could be implemented in SW running on 


the processor to save silicon. 


On the other hand, SW structures and 
algorithms — which can be broken into 
parallel, non-blocking processes — can 
achieve significant speed and data 
throughput improvements by implement- 
ing them in HW. In fact, software engi- 
neers represent a new and emerging mar- 
ket for Platform FPGAs. Aided by design 
tools, it is now easier than ever for SW 
engineers to explore concepts of software 
acceleration via HW. Both hardware and 
firmware are reprogrammable and field 
upgradeable, which enables the develop- 
ment and deployment of several product 
generations from one base. This translates 
into a much broader applicability than 
ASICs and ASSPs. 


In addition to being suitable for building 
complex embedded applications, HW engi- 
neers can utilize an otherwise idle processor 
to run relevant portions of their design 
algorithms or control logic. As Figure 1 
shows, the CPU can serve as a simple 
microcontroller running a single-threaded 
PLD vendors add value by 


providing software device drivers and 


application. 


library functions for rapid implementation 
without requiring the HW engineer to have 


detailed knowledge of SW practices. 


Summary: The Platform FPGAs provide the 
most flexible co-design platform by enabling 
dynamic HW-SW design partitioning. 


#7 — Risk Management 


When compared to Platform FPGAs, ASIC 
SOCs present a huge design risk. With 
more variables and issues to worry about, 
and with limited debug capabilities 


when mistakes are found, 


ASIC designers are 
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forced to either resolve problems subopti- 
mally in software, or in the worst case sce- 
nario, they are forced to respin the silicon at 


great cost and loss of time to market. 


Reprogrammability comes up big here. 
Programmable platforms allow early 
access to silicon. Engineers can validate 
performance and functionality in real sil- 
icon, leading to greater confidence in the 
reliability of the completed system. The 
programmability of the platform allows 
itself to be debugged and upgraded even 
after the system has been deployed. This 
helps promote and protect the time-to- 
market advantage by alleviating a large 


part of the risk. 


Summary: Programmable Platform 
FPGAs provide better control over the 
life cycle management of products by 
minimizing the cost and time penalty for 


design errors and specification changes. 
Conclusion 


In an era when SOCs continue to domi- 
nate mainstream design, Platform FPGAs 
are emerging to take a share of the spot- 
light from ASICs. While ASIC SOCs have 
and will continue to be strong in certain 
segments — like low power, small form fac- 
tor, and high-volume, cost-sensitive con- 
sumer electronic gadgets — an increasing 
range of other infrastructure applications 
in areas such as networking, telecommu- 
nications, industrial electronics, and data 
storage have compelling reasons to move 


to a programmable platform. 
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Since the beginning of the FPGA technolo- 
gy, Xilinx has pushed the boundaries of 
reconfiguration. In earlier FPGA families, it 
was only possible to reconfigure the whole 
FPGA. With the introduction of the 
Virtex™ FPGA families, it became possible 
to partially configure or partially reconfig- 
ure an FPGA. It is also now possible to 
reconfigure a remote FPGA via the Internet 
using Xilinx Internet Reconfigurable Logic 
(IRL™) technology. However, only a few 
companies and a few of all FPGA designs 
make use of IRL technology, because of the 
perception it is expensive, complicated, and 


mostly a proprietary solution. 


What if we could securely reconfigure 
FPGAs in the field simply by sending an 
e-mail message? In this article, we will 
show you just how easy and cost-effective 
that can be. 


Protocol Stack 


Xilinx IRL reconfiguration technology uses 
the same transmission protocols as everyday 
Internet e-mail: 


e TCP/IP — Transmission Control Protocol 
over Internet Protocol transports the 
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¢ SMTP — Simple Mail Transfer Protocol is 


used to deliver the messages. 


¢ POP3 — Post Office Protocol 3 retrieves 


the messages. 


Figure 1 shows a complete Internet proto- 
col stack. Each layer of the protocol stack is 
an abstraction level hiding details from 
other layers on top or below. For example, 
the network access layer does not need to 
know what kind of data it is carrying. 
Whether the data is video, voice, or other, 
it is unimportant to the network access 
layer. The only thing this layer needs to do 
is deliver the data in good quality to the 
upper layers. 


e Application Layer — This is the user 
interface layer. The POP3 and SMTP 
protocols necessary for IRL technology to 


work reside in this layer. 


¢ Transport Layer — This layer implements 
reliable, full-duplex communication over 
the Internet. TCP works in this layer. 


e Internet Layer — Addressing and routing 
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of data happens in the Internet Layer, 


where IP is implemented. 


e Network Access Layer — Also called the 
“link layer,” this is where the hardware 


interface is managed. 


e Physical Layer — This is the actual medi- 
um for communication across great dis- 
tances. Such transmission media 

include co-axial cable, phone lines, fiber 


optics, and wireless broadcasting. 
Implementation 


If you want to implement the Internet 
stack into an FPGA as hardware, it will: 


¢ Take a lot of time (including VHDL or 


Verilog design and simulation). 
¢ Require a robust FPGA. 
¢ Consume a lot of money. 


On the other hand, microcontrollers are 
good for protocol handling and can miti- 
gate the time and cost of building an IRL 


solution for the remote reconfiguration of 


FPGAs. 
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Figure 1 - Internet protocol stack 


Two microcontroller solutions are possible: 
¢ Use external microcontrollers. 


e Put a microcontroller inside a Virtex 


Platform FPGA. 


There are two ways to embed a microcon- 


troller in a Virtex device. 


¢ You can use the software MicroBlaze™ 
microcontroller in Virtex, Virtex-E, 
Virtex-II, or the new Virtex-IJ PRO™ 
Platform FPGAs. 


¢ You can buy a Virtex-II PRO Platform 
FPGA with a hard-wired IBM® 
PowerPC™ 405 microcontroller already 


onboard. 
External Microcontrollers 


Several microcontroller manufacturers have 
Internet capable components. For instance, 
two Ubicom microcontrollers — SX52BD 
and [P2022 — can be used for IRL applica- 
tions. Such external microcontrollers 
require “virtual peripherals” from Ubicom 
and some small modifications and addi- 


tions to control downloading to an FPGA. 


These small controllers and peripherals 
make the implementation of the TCP/IP, 
SMTP, and POP3 components of IRL 
technology easy and fast. Due to the small 
amount of internal memory of the micro- 
controllers, however, the Internet protocol 
stack is tuned to only perform the neces- 


sary functions. 
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Internet layer 


Network access layer 


Physical layer 


Basic Setup 


The simplest setup consists of an FPGA and 
a small controller, as shown in Figure 2. 
Here’s what happens when the system is 
powered up: 


¢ The FPGA is empty. 


e The microcontroller waits for a certain 
time until all components of the IRL 


design have reached a stable state. 


¢ Then the controller connects to the net- 
work by sending AI commands to an 
external modem through an RS-232 
device, or by sending AT commands to an 
onboard chip modem, or by other physi- 


cal implementation. 
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¢ Once the link has been set up with the 
e-mail server, the microcontroller asks if 
there is e-mail available. When there is, the 


microcontroller checks the e-mail header. 


e If the header is not of a particular type, the 
controller will delete the mail message on 


the server. 


¢ When the e-mail has the correct header 
type, the microcontroller downloads it. 
The /PROGRAM pin is toggled, the con- 
tents are serialized onto an output pin, and 


a bit clock is generated. 


¢ When the DONE pin goes High, the 
microcontroller deletes the e-mail on 


the server. 


¢ The microcontroller breaks the connec- 


tion. 


If the DONE pin does not go High after a 
period of time, the download operation is 


repeated until the DONE pin goes High. 
Although this is the simplest approach, it has 


its shortcomings: 
e An Internet connection is obligatory. 


e The server must always have mail ready for 
the application, or else the application can- 


not Start. 


e When configuration fails, no fail-safe 


recovery mechanism exists. 


e The design can go into an endless loop try- 


ing to download its configuration. 





Figure 2 - Basic design setup 
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Figure 3 - Safe download 


Fail-Safe Setup 


You can make the download more reliable by 
storing the downloaded bitstream into semi- 
permanent memory (flash RAM). The 
FPGA can then be reconfigured from the 
flash memory. See Figure 3. 


An even more secure solution is to work with 
two memories. A basic configuration can be 
loaded into the FPGA when it is shipped 
from the manufacturer. During operation in 
the field, the microcontroller can connect to 
the Internet and download a new configura- 
tion into the second memory. The new con- 
figuration bitstream would be downloaded 
into the FPGA at next boot. 


When the download works, the new config- 
uration will be used. If the new program- 
ming bitstream fails, the microcontroller will 


boot again from original memory. 
Internal Microcontrollers 


Internal microcontrollers for FPGAs come in 
the form of MicroBlaze software and the 
hardware-embedded PowerPC 405 processor 
in Virtex-II PRO devices. (As a point of 
interest, the Virtex-II PRO Platform FPGA 
can be configured with both the MicroBlaze 
software and the PowerPC 405 processor, 
but that is beyond the scope of this article.) 


While there are significant advantages of 
having microcontrollers onboard Virtex 
Platform FPGAs, there are also factors, in 
this design case, that require special con- 


sideration: 
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Master mode 


¢ In both cases of soft and hard microcon- 
trollers, the Internet protocol stack must 


be programmed (ported) onto the FPGA. 


¢ The FPGA must get a basic (possible par- 
tial) bitstream to download _ the 
MicroBlaze controller, its memory, and 


peripherals. 


¢ The PowerPC Virtex-IJ PRO Platform 
FPGA must have memory and peripherals 
downloaded before it’s able to boot. 


¢The small control algorithm that was 
done in the previous description within 
the external microcontrollers must now be 
implemented in a small CoolRunner™ 


CPLD, as shown in Figure 4. 


Figure 4 - 
Internal 
processor 





In either case, the first bitstream must con- 
tain the basic application of the FPGA. This 
way the system can operate as a standalone 
unit without problems. However, during 
operation, the FPGA microcontroller can 
contact the Internet and download new bit- 


streams as they become available. 


The “basic setup” for external microcon- 
trollers described above cannot be applied 
to internal microcontrollers, because the 
downloaded bitstream must be stored by 
the FPGA in some semipermanent (flash) 
memory. The final design of embedded 
microcontrollers can have different levels 
of fail-safe operation, depending on the 


system requirements. 
Conclusion 


It is possible to perform IRL operations in a 
safe and inexpensive way using external or 
internal microcontrollers — and some extra 
hardware. Xilinx IRL technology can now be 
used in almost any design that requires field 
upgradability. Reconfiguration by e-mail 
eliminates the need for complex and expen- 
sive design solutions. Furthermore, with a lit- 
tle ingenuity, the IRL solution can be extend- 
ed to designs that were not previously con- 
sidered for field upgradability. Application 
notes with detailed solutions will be posted 
in the coming months on the Xilinx website: 
www.xilinx.comlapps/appsweb.htm. Look for 
“Virtex FPGA” or “FPGA Configuration.” 
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As a designer, you are constantly asked to 
deliver new and enhanced systems. Your 
marketing department asks you to add new 
features, but do you have the time or prior 
experience to design everything in the new 
specification? If not, you can always look at 
including pre-developed system functions 
in the form of intellectual property (IP) 
cores from the FGPA vendor or one of a 
growing number of third-party providers. 
But can you wait until the legal issues 
around IP licensing are resolved before you 
even begin your design? If only there was a 
way to clear the legal hurdles so you could 
spend your time on what matters most — 


getting your design completed on time. 


In September of 2001, Xilinx launched the 
Common License Consortium to specifi- 
cally address this issue. It is an industry- 
first initiative to simplify the licensing 
process for FPGA-based IP cores. At that 
time, Xilinx and more than 21 third-party 
IP providers all agreed to offer a common 
set of IP licensing terms — called the 
SignOnce IP License — to get access to use 
their cores. Let’s look at how you can ben- 


efit from this new and innovative program. 
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Strength in Numbers 
When selecting IP cores for your FPGA 


designs, you have access to Xilinx 
LogiCORE™ products as well as prod- 
ucts from our third-party network of 
AllianceCORE™ providers. This hybrid 
approach ensures you a broad portfolio of 
IP and expertise to choose from. You ben- 
efit from competition, and it potentially 


puts experts right in your own backyard. 


But everything comes at a price. A multi- 
vendor solution introduces the challenge 
of dealing with multiple providers, each 
with a different set of IP license terms. 
Research has shown that license negotia- 
tions between a customer and a single sup- 
plier can exceed six months. Clearly, situ- 
ations that require you to deal with more 


than one vendor would be intolerable. 


The SignOnce IP License takes care of 
this. With it, you can sign up to a single 


set of license terms that, today, gives you 


iz g SIGNONCE 
: IP LICENSE 


Consortium membership continues to 


access to 500 
IP cores from 
well over 30 


providers. 


grow and includes companies from North 


Consortium to streamline the IP licensing 
Si process and improve your time to market. 
-_- 


America, Europe, Japan, and Southeast 
Asia, so there is likely a solution provider 
somewhere near you. There is no cost for 
membership in the consortium, and only 
IP providers may join. They do not need 
to participate in the AllianceCORE pro- 
gram to become a member, and there is no 
cost to customers to gain the benefits of 


the program. 
What Is Covered? 


The SignOnce IP License leverages the fact 
that licensing IP cores for use in a specific 
FPGA is less complex than licensing for 
use in an ASIC. ASIC use usually requires 
the transfer of source code. This requires 
extensive legal wording that can swell to 
30 pages to protect the vendor against such 


issues as improper use and piracy. 


The SignOnce IP License reduces this page 
count to four by focusing on the transfer of 
an FPGA netlist. Netlists are specific to 
particular FPGAs, and they are difficult to 
reverse engineer or port to a different tech- 
nology. This dramatically simplifies the 
license process. After a SignOnce IP 
License is in place, you will then be able to 
purchase individual cores at a price that 


you and the vendor(s) agree on. 
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Usage Options 
The SignOnce IP License provides project- 


based and site-based usage options. 


A project-based license allows you to use a 
core within a single design. Ultimately, the 
core is incorporated into a larger design, con- 
verted to a bitstream, and pro- 
grammed into the FPGA. Based on 
this, the definition of a “project” 


includes the following scenarios: 


eA single bitstream, which can 
include multiple instances of the 
core. You can then use that bit- 
stream on one or more printed 
circuit boards. In this case, the 
project is defined as the entire 
chip that includes the core, which 
can then be used (without modi- 
fication) in other designs. This is 


similar to the way you might use 


an ASIC or ASSP device. 


eA single printed circuit board, 


using one or more bitstreams, 





each containing one or more 


instances of the core. This allows 


you to leverage the reconfigurable Qaep 


nature of FPGAs in your design. 
Your design group for the “proj- 


ect” can span multiple sites. Usage of the 
IP core that goes outside of the above def- 
initions would require you to negotiate 
additional fees with the IP vendor. 


On the other hand, a site-based license 
allows your company to use the core in 
unlimited designs developed at the speci- 
fied site. You would have to pay additional 
fees for usage at other sites, or you could 
negotiate a list of sites to be covered under 


the up-front license fees. 


In general, site-based licenses cost more than 
project-based licenses from IP vendors that 
support both. Most consortium participants 
support both license types, but some only 
offer one. You should inquire about this 


when working with the vendors. 
Simplifying the Process 


The SignOnce IP License consists of three 
parts: the legal terms contained in the body 
plus two exhibits. 
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The legal terms form the bulk of the license 
and govern issues such as usage (project 
versus site), intellectual property rights, 
indemnity, warranty, export restrictions, 
governmental use, and so on. All legal 


terms (except for usage) are identical for 


both project and site versions of the license. 
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Figure 1 - IP Center Smart Search with SignOnce IP license option 


Exhibit A is designed to list the specific 
cores that you will license. Once you have 
a SignOnce IP License in place with a con- 
sortium member, you can license addition- 
al cores from them by negotiating a sepa- 
rate Exhibit A for each core and paying the 


appropriate license fees. 


Exhibit B is designed for listing the consor- 
tium members that you wish to do business 
with. Each member signs and attaches a 
separate Exhibit B so you can add IP 
providers at will. You can sign all members 
on at once or add ones as needed by simply 
having each sign a separate Exhibit B that 


references the original license terms. 


This structure provides 
considerable flexibility. 
Even if you sign a 
license with one vendor 
for one core, you have 
the ability to later sign 


on additional consor- 


— oe es 


Information 


Program information, consortium 


members, and license forms 


Xilinx IP Center Smart Search Engine 


tium members to purchase more cores. 
Because members have agreed to use the 
same licensing terms, additions take little 
further assistance from your legal depart- 
ment. When dealing with any consortium 
member, make sure you let them know that 
you are interested in licensing IP using the 


SignOnce IP License. 
In Search of SignOnce IP 


All SignOnce IP cores available from 
Xilinx and our AllianceCORE part- 
ners can be found in the Xilinx IP 
Center (www.xilinx.com/ipcenter/). 
As shown in Figure 1, the Smart 
Search engine allows you to restrict 
your searches to only show cores that 
are available under the SignOnce IP 
License. Even if you don’t select this 
option, the search results will indi- 
cate which are SignOnce cores. You 
will need to contact non- 
AllianceCORE members of the con- 
sortium directly to find out what 


cores they offer. 


Conclusion 





= 


Your job as a designer is already dif- 
ficult enough. IP cores are available 
to simplify and accelerate your 
development process, allowing you 
to focus on the portions of the design 
where your expertise adds value. If you 
adopt the SignOnce IP License and deal 
with members of the Common License 
Consortium for your IP, you will remove a 
major time-to-market bottleneck for you 


and your company. 


For more information on the SignOnce pro- 
gram — including the growing list of consor- 
tium members, copies of the licenses, and 
searchable lists of Xilinx and AllianceCORE 
vendor IP — follow the Web links shown in 
Table 1. The solution is available. Let it 


work for you. 


Web URL 


www. xilinx.com/ipcenter/signonce.htm 


www.xilinx.com/search/ipsearch.htm 





Table 1 - Web links for SignOnce program information and products 
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New tools for Xilinx Reed-Solomon LogiCORE 
products help speed development and reduce errors 
in noise-prone multimedia and communications devices. 


by Warren Miller 

VP ot Marketing 

Avnet Design Services 
warren.miller@avnet.com 


Because Reed-Solomon codes are very 
good at correcting burst errors, they are 
used in applications where noise or defects 
can take out a series of information bits. 
Products like CD and DVD players, as well 
as wireless and hard-wired communica- 


tions systems, use Reed-Solomon codes. 


Successful communications in noisy envi- 
ronments using Reed-Solomon codes are 
possible through their means of detecting 
and correcting multiple errors without 
needing to retransmit the data. This 
increases bandwidth and improves data 
integrity in error-inducing communica- 


tions channels. 
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Avnet Design Services has created a hard- 
ware reference design to demonstrate the 
functionality of two Xilinx Logi CORE™ 
Reed-Solomon IP cores in an error-prone 
system. The reference design uses standard 
Avnet-developed evaluation kits with the 
Raptor/ISD (in-system developer) IP envi- 


ronment from Experience First Inc. 
Strength from the Core 


The strength of Reed-Solomon codes in 
handling multiple bit errors makes them 
one of the most efficient and common 
error correction codes used in communica- 
tions applications. Other codes (such as 
Hamming codes) are better suited for the 
more random error patterns found in 
memory systems, where only one bit at a 


time is affected. 





Reed-Solomon codes append a series of 
symbols (check words) to a series of infor- 
mation symbols (data words). The check 
words are constructed (similar to the famil- 
iar parity check) in such a way that if errors 
are introduced anywhere in the data, the 
correct data can be reconstructed. The 
number of errors that can be corrected is 
one half the number of check symbols 
(rounded down). For 8-bit-wide data 
words, a common code has a block size of 
207 words with 20 check symbols and 187 
data symbols. This is the example code we 


will use in the reference design. 


The Avnet Virtex Development Kit 


The Virtex™ Development Kit developed 
by Avnet includes hardware tools on a sin- 


gle board for testing and debugging a refer- 
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Figure I - The Avnet reference design hardware for Reed-Solomon code applications uses a 
Xilinx Spartan-II FPGA Encoder (left) and a Virtex-II Platform FPGA Decoder (right). 


ence design implementing Reed Solomon 
codes on Xilinx Virtex and Virtex-II 
Platform FPGAs. The Avnet board 


includes the following features: 


e Data Generator, which creates an input 


data stream 


e Reed-Solomon Encoder, which takes 
blocks of input data and creates the check 
symbols 


e Error Generator, which injects controlled 
pseudo-random error patterns into the 


communications channel 


¢ Reed-Solomon Decoder, which detects 


and corrects errors in the transmitted data. 


The Data Generator and Encoder reside on 
the Avnet-designed Xilinx Spartan™-I] 
FPGA evaluation board, as shown in 
Figure 1 (left). The data block with 
appended check words is transferred to the 
Virtex evaluation board where the data is 
received and corrected by the Decoder, as 


shown in Figure 1 (right). 


The Error Generator uses a pseudo-random 
number generator to select errors to inject 
into the information block. Adding errors to 
a reference design is necessary, because when 
there are no errors, it is not clear that any- 
thing is actually being done by the design. 
An IP development tool that works with the 
Avnet Virtex development board allows you 
to easily see the results of the decoder’s pro- 


cessing in the reference design. 
Raptor IP Development Environment 


The Raptor IP Development Environment is 
an integrated software and firmware tool that 
works in conjunction with the Avnet Virtex 


Development Kit. The Raptor environment 
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accelerates the development, integration, and 
test of IP or IP-based Xilinx FPGA designs. 
Raptor automatically inserts logic around 
any IP core to route inputs and outputs to 
real-time input signals, output signals, and to 
buffer memory available on the development 
board. This allows the core to be exercised at 


“hardware speeds.” 


Core output signals that are stored in the 
buffer memory can be read over 
the USB port to a host computer 
and displayed on the waveform 
viewer, just like software simula- 
tion results. This hardware speed 
approach to verification reduces 


the design/debug cycle time dra- 





matically, making it easy to 


el ELLE T 
qd 


change the design or test set-up 


and see the results immediately. 


Used in conjunction with hard- 
ware-based input stimuli, verifi- 
cation of very complex and 
robust test suites are orders of 
magnitude faster than approach- 
es based on pure software simula- 


tion. 


The Raptor user interface, shown 
in Figure 2, allows you to specify 
the inputs and outputs to your 
IP The necessary logic is auto- 
matically added around your IP 
and then compiled using the 
Xilinx 


(Foundation™ Series software in 


development tools 


this example). The IP core is then 
exercised by the hardware and the 
results are read out of memory 
and transferred to the PC over 
the USB port. The signals can be 
displayed on a waveform display 
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(a sample output is shown in Figure 3). 
Because the signals are run at hardware 
speeds, hundreds of thousands of cycles can 
be exercised in the same time as a single 


cycle of software-only simulation. 
Conclusion 


Reed-Solomon codes are a basic building 
block of many digital communications 
systems. The Xilinx LogiCORE Encoder 
and Decoder were easily ported to the 
Avnet development boards, using the 
Raptor IP development system from 
Experience First. For more information on 
Avnet 


Development Boards visit 


www.ads.avnet.com or contact Warren 
Miller at 
(warren. miller@avnet.com). For more infor- 


Raptor ISD visit 


Avnet Design Services 


mation on the 


expfirst.com or contact Bob Read at 


Experience First (bread@expfirst.com). 
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Figure 2 - Experience First's Raptor user interface lets you 
specify input and output signals to your IP reference design. 
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Figure 3 - Real-time Decoder waveform signals are 


input through the USB port. 
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FPGAs are critical to the success 
of the digital video revolution. 


by Amit Dhir 

Manager, Strategic Solutions 
Xilinx, Inc. 
amit.dhir@xilinx.com 


In the early 1970s, the only piece of equip- 
ment needed for watching TV was a stan- 
dard television. In the 1980s, this simple 
model began to change. Cable and satellite 
TV providers required consumers to con- 
nect their [Vs to dedicated networks. Also, 
operators decided to scramble TV signals, 
requiring a special box to de-scramble the 
signals at the consumers home. Today, dig- 
ital television requires a set-top box to 


receive and decode digital transmissions. 


The main function of a set-top box is to 
receive additional digital transmissions 
(cable, satellite and/or terrestrial channels) 
and to decode into a form suitable for dis- 
play on analog television sets. It’s a complex 
electronics device comprised of a myriad of 
hardware and software components, usual- 
ly connected to the TV set and the cable 
connection on the wall. They are installed 
and configured by the local cable, terrestri- 


al or satellite service provider. 
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The home audio-video landscape is 
quickly transitioning from analog to dig- 
ital, now providing several hundred chan- 
nels of 24-hour coverage, Internet access, 
and other services. The convergence of 
functions provided by the television and 
the PC requires a platform to provide 
these services. With the arrival of several 
services, the set-top box is growing 
beyond its traditional function of 
enabling digital video. Future generations 
of set-top boxes will provide more services 
such as the ability to pause, record, store 
and replay live video; provide video on 
demand (VoD); provide Internet access; 
and control other consumer devices. 
Providing these services will require the 
relatively simple set-top box to add com- 
ponents such as flash memory, hard-disk 
drives, security chips, home networking 
chipsets, and so on. Programmable logic 
solutions will enable the integration of 
these services and components in future 


generations of set-top boxes. 






The Set-Top Box Today 


Digital TV set-top boxes are sometimes 
called receivers. A set-top box is necessary 
to television viewers who wish to use their 
current analog television sets to receive dig- 
ital broadcasts. Typical functions imple- 


mented in a set-top box include: 
¢ Decoding the incoming digital signal 


¢ Verifying security levels as well as 


content access rights 


¢ Separating the audio and video data 


from the decoded signal 


¢ Decoding the separated audio and 


vatelcomerticy 
¢ Presenting video to the display device 
¢ Presenting audio to the audio outputs 


° Processing and rendering frlesunlameelee 


tent and other interactive services 


¢ Providing electronic program guide and 


remote control features. 


Spring 2002 





| Perspective [ARPS Srlenseccae Boxes 


Figure 1 shows the block diagram of a 


generic current generation set-top box. 


According to Dataquest, sales of digital set- 
top boxes reached 25.1 million units in 
2000, worth an estimated $6.3 billion, and 
will exceed 92 million units and revenues 
of $11 billion by 2005. It is estimated that 
35 million U.S. homes will use digital set- 
top boxes by the end of 2006, the estimat- 
ed year ending the transition to DTV. 


Digital set-top boxes are used for satellite, 
cable, and terrestrial digital TV services. 
They are especially important for terrestrial 
services because they guarantee viewers free 
television broadcasting. A set-top box price 
ranges from $100 for basic features to over 
$1,000 for a more sophisticated box. It is 
often leased as part of signing up for a serv- 
ice. Leading set-top box manufacturers are 
looking to increase revenues by winning con- 
sumers that would like to use the TV set not 
only to watch television programs but also to 
browse websites, send e-mail, and shop for 


goods and services through the Internet. 


Set-top boxes in the future will provide 
more channels and increased choices 
through specialist channels, which can pro- 
vide immediate feedback to broadcasters. 
The set-top box will provide new services 
such as improved pay-as-you view, online 
shopping, 


demand, and hard-drive storage. 


interactive IV, video-on- 


Providing Digital Video Processing 


In the digital TV realm, a typical digital 
set-top box contains one or more micro- 
processors for running the operating sys- 
tem (possibly Linux or Windows), and for 
parsing the MPEG transport stream. A set- 
top box also includes RAM, an MPEG 
decoder chip, and more chips for audio 


decoding and processing. 


The contents of a set-top box depend on 
the digital TV standard used. European 
DVB-compliant set-top boxes contain 
parts to decode COFDM transmissions 
while ATSC-compliant set-top boxes con- 
tain parts to decode VSB transmissions. 
The set-top box provides improved quality, 
support for HDTV transmission, prevents 


“ghosting or interference effects, and allows 
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broadcasters to provide flexibility in terms 


of bandwidth vs. quality. 
The MPEG-2 encoder, at the heart of every 


set-top box, is composed of a number of 


discrete algorithmic sections: 


e Temporal processing — It seeks out and 
removes temporal redundancy. This 
involves storing several successive 

images and performing motion estima- 

tion, compensation, and simple algo- 
rithmic processing to derive a pixel-by- 


pixel difference signal. 


e Spatial Processing — It uses DCT (Discrete 
Cosine Transform) to remove the high fre- 


quencies not discernable by the human eye. 


Statistical or variable length encoding 
(VLC) is used to remove redundancy in the 
output from the DCT. The MPEG-2 algo- 
rithm makes use of the DCT/IDCT algo- 
rithm. DCT returns the discrete cosine 
transform of 


“video/audio 


Figure 1- Block diagram of 


current generation set-top box 


input” (referred to as the even part of the 
Fourier series) and converts an image or 
audio block into its equivalent frequency 
coefficients. IDCT (or inverse DCT) recon- 
structs a sequence from its DCT coeffi- 
cients. Compression allows increased 
throughput through the transmission medi- 
um. Video and audio compression makes 


multimedia systems very efficient by increas- 


ing CPU bandwidth, providing higher video 
frame rates and better audio quality, and 


enabling multimedia interactivity. 


While MPEG-2 is the most common com- 
pression scheme used in set-top boxes, 
MPEG-4 compression technology is gath- 
ering acceptance. It is earning keen interest 
from set-top-box vendors and semiconduc- 
tor companies hungry to add features (such 
as picture-in-picture) to current designs, 
and from service providers eyeing it for 
home networking and for set-tops integrat- 


ed with digital video recorders (DVRs). 


The MPEG-4 initiative is led by satellite 
providers exploring ways to reduce the bit 
rates of their broadcast streams and by cable 
operators looking to add object-based inter- 
active features to their programming. Using 
the MPEG-4 Advanced Coding Efficiency 
profile, satellite broadcast streams — current- 


ly delivered at bit rates of 2.5 to 3 Mbps — 





can be trimmed to 1.6 Mbps while main- 
taining good-quality video. Meanwhile, 
some service operators are also considering 
MPEG-4 as a way of saving hard-disk-drive 
space in set-top boxes equipped with DVRs. 


Others see MPEG-4 as effective in reduc- 
ing the bandwidth required for real-time 


video for home-networking applications. 
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There is a need for MPEG-4 in picture-in- 
picture applications — while a national 
game broadcast occupies the screen, for 
example, a local game compressed in 
MPEG-4 could be broadcast simultaneous- 
ly so that it appears as a picture within the 
picture. Further, Internet-streaming con- 
tent — compressed in MPEG-4 or 
Windows Media — could also be displayed 


on screen ina picture-in-picture format. 


Most set-top box manufacturers are not 
looking to replace MPEG-2 with MPEG- 
4, but to equip a set-top box to transcode 
MPEG-2 streams into MPEG-4. Clearly, 
the versatility of the MPEG-4 standard is 
playing to its advantage as MPEG-4 finds 


its way into different set-top uses. 
Integrating DVR, VoD, and NetTV Functions 


The last few years has seen the introduction 
of the digital video recorder (DVR), also 
known as digital VCR and personal video 
recorder (PVR). The DVR uses local storage 
(such as a hard-disk drive) to enable the user- 
controlled storage and playback of live digi- 
tal video streams. The functionality includes 
the ability to simultaneously record and play- 
back separate video streams or different por- 


tions of the same stream in real-time. 


With a built-in modem, the DVR device 
dials a service provider and downloads the 
programming guide and other software 
updates. While high unit growth is being 
predicted for these DVRs, most set-top box 
manufacturers are looking to incorporate a 
hard-disk drive and DVR functionality with- 
in the set-top box. This will provide the capa- 
bility to store real-time TV broadcasts in 
high digital quality, instant access to the 
recorded data, proactive and quick TV man- 
agement, and the simultaneous use of multi- 
ple data streams. It will also allow the ability 
to download software and other applications 


provided by a digital TV service provider. 


Apart from providing real-time TV broad- 
cast recording capability, set-top boxes will 
provide the consumer with video-on- 
demand capability. The consumer will be 
able to order a movie instantly. It provides 
the ability to pause, fast forward, and rewind 


the movies as often as the consumer desires. 
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The NetI’V is a T'V-centric consumer appli- 
ance that provides Internet access while using 
the T'V as the primary display. In its most basic 
offering a NetT'V provides limited interactive 
electronic programming guides or customized 
information tickers. Advanced NetI'Vs pro- 
vide full graphical Web browsing and video 
email. While the consumer is very interested in 
services such as access to the Internet, cost 
remains a big inhibitor for the success of the 
NetTV appliance. Set-top box manufacturers 
are looking to integrate the functionality of the 
Net T'V in next generation set-top boxes. In the 
Internet realm, a set-top box is really a special- 
ized computer that can “talk to” the Internet — 
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Figure 2 - Digital set-top box 


that is, it contains a Web browser (which is 
really a Hypertext Transfer Protocol (HTTP) 
client) and the Internet's main program, 
TCP/IP. The service to which the set-top box 
is attached may be through a telephone line or 
through a cable TV company. 


Enabling Broadband Access 
and Home Networking 


The arrival of digital media such as data, 
voice, video and communications (Internet) 
through appliances such as digital cameras, 
MP3 players, cellular phones, and Web 
pads, is bringing a need for networking con- 
sumer devices and PCs. All these consumer 


devices are demanding high-speed access to 
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the Internet. A house however will have a 
single broadband access point and the digi- 
tal media will be shared not only between 
friends and family across the Internet but 


also between appliances in the house. 


The set-top box of the future will provide 
high-speed Internet access from satellite, 
cable, DSL, fixed wireless, and terrestrial 
access technologies. While one box may 
support a single access technology, combina- 
tions of broadband access technologies will 
exist within one set-top box. Set-top boxes 
will also provide interconnectivity between 


consumer devices through a number of 
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home networking technologies, such as: 
¢ No new wires — Phonelines, powerlines 


e New wires — IEEE 1394, IEEE 1355, 
Ethernet, USB 1.1/2.0, Optic Fiber, RS- 
232, IEEE 1284 Parallel Port 


e Wireless — HomeRE, Infrared, Bluetooth, 
DECT, IEEE 802.11b, IEEE 802.11a, 
HiperLAN2 


As home networking gains popularity, 
the set-top box will grow from enabling 
digital TV and broadband access in the 
house, to a residential gateway that man- 
ages and controls other information 


appliances in the home. 
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Categories and Evolution to 
Home/Residential Gateway 


This huge installed base of set-top boxes 
can be broadly classified into the following 


categories: 


e Analog set-top boxes perform the func- 
tion of receiving, tuning, and de-scram- 
bling incoming television signals. These 
receivers have changed very little over the 


past twenty years. 


¢ Dial-up set-top boxes allow subscribers to 
access the Internet from the comfort of 


their living room through the television. 


¢ Entry-level digital set-top boxes are capable 


of receiving broad- 


advanced services such as video telecon- 
ferencing, home networking, IP telepho- 
ny, VoD, and high-speed Internet TV 
services. Additionally, it has enhanced 
graphical capabilities to receive high defi- 
nition T'V signals and can store video on a 
hard disk drive, while providing the capa- 
bility to record and view video simultane- 
ously. Such receivers have a range of high- 
speed interface ports, and resemble resi- 
dential gateways. For cable, terrestrial, and 
satellite companies, set-top boxes that 
support advanced technologies are an 
opportunity to increase revenue streams 


through providing services. 


box for receiving television and a modem 
to connect to the Internet. Phase two 
includes advanced features such as broad- 
band connectivity, home networking inter- 
faces, and IP telephony in the residential 
gateway. The third phase will be deploy- 
ment of powerful gateways, capable of 
delivering video, voice and data through- 
out the home and providing services such 
as home automation, energy management, 


security control, and so on. 


Supporting multiple technologies makes 
the gateway less likely to become obsolete. 
Support for modularity will fuel the evo- 
lution of gateways into a type of applica- 

tion server that 


_ .. consumers will 
cast digital television oe 
NTSC use to distribute 
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box include limited 


quantities of memo- 





ry, interface ports 
and processing 
power. Figure 2 
shows a digital set- 


top box. 


Mid-range digital set-top boxes include a 
return (back) channel, which provides 
communication with a server located. 
These set-top boxes provide e-commerce, 
Internet browsing, and multimedia serv- 
ices. The return channel further allows 
for customized broadcasts to the local 
viewing population. These types of boxes 
have higher processing power and storage 


capabilities of entry-level boxes. 


Advanced digital set-top boxes are like 
multimedia desktop computers, contain- 
ing much higher processing power than 
other set-top boxes. Enhanced capabilities 
in conjunction with a high-speed return 


path can be used to access a variety of 
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Figure 3 - Residential gateway 
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The residential gateway is a platform to 
bring broadband into the house and con- 
nect or bridge to home networking tech- 
nologies. It enables communication 
between networked appliances in the home 
and across the Internet. The evolution of 
new data broadcasting services has created 
the need for a device to pass digital content 
between the Internet and the home net- 
work. Future gateways will provide integrat- 
ed services such as remote management, 


home automation, and home security. 


Mass deployment of gateways will come in 
three distinct phases. While the gateway 1s 
a new term, it already exists in many 


homes. Most of our homes have a set-top 


Controller 









DACs 


10/100 Base-TX 


gateway must 
have a reliable 
HomePNA and robust hard- 


ware platform, 
and software that 
is not susceptible 
to errors. Unlike 


PC users, 


sumers will not 


Transceiver con- 


USB 


é UTP 
Transceiver 


stand rebooting 


their gateways. 


IEEE 1394/Firewire : 
Supporting mul- 


RS-232 services 


tiple 
with complete 

security is essen- 
tial. Functions such as e-commerce trans- 
actions, and remote home control and 
access from authorized service providers 
are critical. Providing quality of service to 
support multiple intelligent devices from 


different vendors is extremely important. 


Figure 3 shows the block diagram of a resi- 
dential gateway. The gateway provides a 
unified platform to satisfy the needs of most 
consumers, providing infotainment, con- 
veniences, and communication. The success 
of the set-top box, and hesitation by a large 
population to own a PC, allows for the set- 
top box to grow into a successful residential 
gateway. Set-top boxes will evolve into mul- 


timedia servers, forming the hub of the 
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home network for primary access to the 


Internet, such as the residential gateway. 


Programmable Logic Solutions Enable Future 
Set-Top Boxes 


For residential gateways to be successful, 
programmable solutions will have to be at 
the heart of the system. Programmable logic 
devices address the fundamental disconnect 
between ASSPs and provide the interface 
and protocol translation between different 
broadband and home-networking chipsets. 
They provide significant time-to-market 
advantages and the ability to upgrade quick- 
ly, which is imperative for a successful prod- 
uct. This gives you the cutting edge to bring 
products first to market, while having the 
ability to remotely add features to the set- 
top box already deployed in the home. 


The set-top box will combine components 
such as digital modem chipsets, home net- 
working chipsets, processors, memory, and 
software. Digital modem chipsets provide 
connectivity to different broadband net- 
works, and home networking chipsets pro- 
vide interconnectivity technologies between 
appliances. Other ASSPs/ASICs handle and 
process digital video and audio services. 
These ASSPs communicate with each other 
via buses on the system board. The proces- 
sor is responsible for coordinating the differ- 
ent components. With additional features, 
set-top boxes will require higher perform- 
ance processors to keep pace with increased 
data throughput. However, all these stan- 
dards and ASSPs promote differing inter- 
faces making glueless connectivity impossi- 
ble between the different ASSPs, memories, 


and processors. 


There are three types of set-top box soft- 
wate: operating systems, middleware, and 
applications. The operating system operates 
the set-top box parts. The middleware is a 
layer of software programs that operates 
between the interactive TV applications and 
the operating system. Viewers use applica- 
tion software to watch TV and use interac- 
tive features. The middleware also enables 
the smooth interoperation of information 
appliances and services within the home, to 
eliminate the complexity, distribution, and 


technical disparity of the system elements. 
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For video processing conventional DSPs 
(digital signal processors) provide a fixed 
data width and inflexible architecture. They 
typically have 1-4 MAC units and require 
serial processing, which limits data through- 
put. This causes a need for high clock fre- 
quency DSPs, which creates system chal- 
lenges. Hence, multiple DSPs are needed to 
meet bandwidth requirements, thus causing 


board 


Programmable logic devices have a flexible 


power and space issues. 
architecture with distributed DSP resources 
and embedded multipliers. These devices 
can support any level of parallelism or serial 
processing through an optimal 
performance/cost tradeoff. 
This parallel processing 
maximizes the data 
throughput. 
Hence, program- 

mable logic solu- 

tions exceed DSP 
requirements of 

the video market 

— providing both 
flexibility and_per- 
formance. FPGAs are 


off-the-shelf 


which provide fast time-to- 


devices, 


market, rapid adoption of stan- 

dards, optimal bit widths, and real-time pro- 
totyping along with support for high data 
rates. With a whole suite of DSP algo- 
rithms/cores and tools created for program- 
mable logic devices, development time can 
be reduced by weeks while increasing the 
performance of the system. For example, the 
implementation of the DCT/IDCT core in 
a programmable logic device can off-load 
the system processor performing MPEG 
encoding/decoding with a 50X to 200X per- 


formance gain. 


Programmable solutions also provide the 
ability to interface to different hard-disk 
drive types as well as NAND and NOR flash 
memory types (depending on availability). 
They also provide content protection capa- 
bilities using DES, triple DES, AES, and 
even proprietary encryption schemes. They 
also provide system interface functions such 
as PCI, USB, and so on, in the set-top box 


and residential gateway. In addition, the 





presence of the FPGA in the system provides 
the capability to remotely upgrade features 
through the Internet when the box has been 
shipped to the customer, hence providing a 


significant cost savings. 
Summary 


The set-top box is driving the digital revolu- 
tion right into your living room. Your finger- 
tips now command a wealth of high-quality 
digital information and digital entertain- 
ment, right from your favorite armchair. The 
set-top box revolutionizes home entertain- 
ment by providing vibrant television images 
with crystal clear sound, along with 

e-mail, Web surfing, and cus- 

tomized information such 
as stock quotes, weather 
and traffic updates, 
shopping, 


and video-on- 


on-line 


demand — right 
through a tradition- 


al television. 


The set-top box will 
evolve into home mul- 
timedia centers, possibly 
forming the hub of the 
home network system and the 

primary access for consumers to connect 

to the Internet. There will be a convergence 
of technology with equipment connected to 
the television, such as adding hard drives for 
television program storage and instant replay. 
As the set-top box evolves, it will also provide 
home networking capabilities and value- 
added services, while becoming the residen- 


tial gateway of the future. 


Programmable logic solutions are necessary 
for the success of set-top boxes and resi- 
dential gateways as they provide time-to- 
market and time-in-market advantages in 
interfacing disparate technologies, compo- 
nents and system interfaces. They also pro- 
vide a significant performance advantage 
over digital video processors. What is clear 
is that the set-top box market is growing at 
a very dramatic rate, and when markets are 
so dynamic and the future is very unpre- 
dictable, decreased time-to-market and the 
ability to upgrade very quickly is impera- 


tive for a successful product. 
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EEE 1394 Hardware 
Reference Design 


Na reference design provide OEMs with 
a complete low-cost solution tor integra ting 


digital video | in consumer products. = 
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Xilinx and Convergent Designs recently 


announced the availability of Centauri, a 
low-cost digital video (DV) IEEE 1394 
hardware reference design based on the low- 
cost Xilinx Spartan™ FPGA and the Divio 
NW701 DV codec. The new reference 
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“The Spartan family of FPGAs provided 
an ideal low-cost solution,” said Michael 
Schell, president of Convergent Design. 
“Because of the Xilinx commitment to 
the Digital Video market, we're able to 


rapidly introduce cost effective consumer 


“THE SPARTAN FAMILY OF FPGAS PROVIDED AN IDEAL LOW-COST SOLUTION. BECAUSE OF THE XILINX COMMITMENT 
TO THE DIGITAL VIDEO MARKET, WE RE ABLE TO RAPIDLY INTRODUCE COST EFFECTIVE CONSUMER SOLUTIONS THAT 


REDUCE OVERALL TIME-TO-MARKET AND COST. 


design provides OEM manufacturers with a 
complete low-cost solution for easily inte- 
grating DV over IEEE 1394 into a variety 
of consumer digital video products such as 
DVD R/Ws, digital VHS recorders, set-top 
boxes, residential gateways, Personal Video 
Recorders (PVRs), Digital TVs (DTVs), 
video projectors, video editors, and profes- 


sional digital audio and video equipment. 


Spring 2002 


— MICHAEL SCHELL, PRESIDENT OF CONVERGENT DESIGN. 


solutions that reduce overall time-to- 


market and cost.” 


“The introduction of this hardware refer- 
ence design is a great example of the com- 
pelling value that FPGAs offer the con- 
sumer marketplace,” said Robert Bielby, 
senior director of Strategic Solutions 


Marketing at Xilinx. “Working closely 






Reference Design 


with Convergent Designs and _ utilizing 
their digital video and audio design expert- 
ise allows us to provide some exciting solu- 


tions for the consumer market.” 
Pricing and Availability 


Centauri is priced at $450 and immediate- 
ly available for purchase on the Xilinx eSP 


website (www.xilinx.com/esp). 
Xilinx eSP for Digital Video 


Xilinx eSP (www.xilinx.com/esp) is the 
industry's first Web portal dedicated to 
accelerating the design and development 
of consumer products based upon emerg- 


Recently 


ing standards and protocols. 
updated to contain broad coverage of dig- 
ital video technologies, the eSP Web portal 
is a comprehensive resource delivering a 
powerful array of solutions and informa- 


tion in a single location. 


About Convergent Design 


Convergent Design has joined the 
Xilinx XPERTS program, a world- 
wide third-party Certified Design 
Service Center trained to take full 
advantage of the features in Xilinx 


FPGAs, software, and IP cores. 


Convergent Design LLC specializes 
in the design and development 
of analog digital video/audio 
conversion and processing products. 
Specific technologies include: PCI, 
1394, DV/MPEG2 compression, 
component video, SDI digital video, 
balanced audio, and AES/EBU 
digital audio. Convergent Design 
offers comprehensive design services 
including initial product specification, 
schematic capture, FPGA code 
development, PCB prototype and 
debug, compatibility testing, and 


release to contract manufacturer. 
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Programmable Logic Enables 
Digital Displays re 


Xilinx, Inc. 
mike.nelson@xilinx.com 


An overview of an important emerging market. 


DisplaySearch Inc. forecasts that by 2005, 
digital displays will eclipse conventional 
display technologies in market revenue. 
Digital liquid crystal displays (LCDs), dig- 


Sanit o ital plasma display panels (PDPs), digital 
ell fit yt) eee — — light processors (DLPs), and many others 
a : 






are fast becoming the display technologies 
of choice. 


This article will explain why this is hap- 
pening and document the universe of com- 
plexity that has been spawned in the 
process. Next we will examine some of the 
unique challenges that digital display sys- 
tems pose for you. Finally, we will review a 
representative case study to illustrate how 
programmable logic can be used to your 
advantage in developing digital display 
products. 


The Digitization of Display Technologies 


What's driving the digital display transi- 
tion? There are three basic answers to this 


question: 

¢ Content has become digital 

¢ Digital displays achieve superior quality 
¢ Digital display technologies have enabled 


new and desirable form factors. 


Content is king. Analog television became 
inexpensive because it served a huge con- 
tent market: broadcast television. But 
things have changed. The emergence of 
commodity PCs, the Internet, digital cable 
and satellite TV, and consumer DVD have 
combined to establish a huge new universe 
of digital content. And, as the transition to 
digital broadcast television unfolds (SDTV 
and HDTV), virtually the entire display 


universe will have become natively digital. 
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With regard to quality, digital content has 


several advantages. 


¢ Digital content can more effectively filter 
noise as illustrated by the lack of back- 
ground hiss on an audio CD as compared 


to an analog cassette. 


¢ Digital content enables digital manipula- 
tion of the content which means that you 
can effect highly specific enhancements. 
For example: shadow enhancement, 
sharpness control, and color manipula- 


tions commonly used in medical imaging. 


e Digital content can be quickly and 
repeatedly replicated with precision and 
widely distributed much more efficiently 


than analog formats. 


And then there is form factor. Without dig- 
ital displays, products such as today’s lap- 
top computer would simply not be possible 
(remember the first Compaq CRT based 
portable PCs?). The same applies to wall 
mounted televisions, PDAs, seat back dis- 
plays on airplanes, and the mini-displays 
on the back of digital cameras. The inher- 
ent flat panel nature of most digital display 
technologies is thus a key factor in their 
increasing success. These applications 
could not be fulfilled without them. 


The Opening of Pandora’s Box 


While the emergence of digital display tech- 
nologies has been a good thing, the develop- 
ments behind them have been the equiva- 
lent of opening Pandora's Box. In the frantic 
rush to digitization, technologies have been 
invented and re-invented on a multi-dimen- 
sional scale across diverse geographies, 
industries, and suppliers. The result is a mul- 
titude of standards, formats, regulations, 
specifications, and derivatives, all of which 


makes your job ever more challenging. 


To illustrate the complexity that we have 
wrought, I have compiled a representative 
list of technologies, formats, and standards 
introduced as part of the mass digital 
emergence, in Table 1. 


Difficult Questions 


While by no means complete, Table 1 nev- 
ertheless illustrates a foremost challenge 


facing you today: risk and complexity. 
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Which are the correct features to support? 
How do they vary by market segment? 
How do they vary by geography? How are 
they evolving? How do you implement the 
mix of features you need quickly and effi- 
ciently? Can you effectively support a fam- 
ily of configurations to service multiple 
market segments? How do you get to mar- 


ket fast enough to gain market share? 


Digital convergence is driving these tech- 
nologies together, in new ways and in new 
products — and making that happen is your 
responsibility. 


Unique Challenges of Digital Display Design 


Beyond the complexity of their world, 
environment digital displays also present 
some unique challenges to you. These 
include achieving the performance 
required for the target application, cor- 
recting for technology-specific display 
characteristics, and generating the drive 


signals for the target display. 


Performance is particularly challenging in 
digital video display applications due to the 


tremendous computational loads involved. 


Wired Broadband 
Connection Access 
Technologies — Technologies 


Chip to Chip 
Technologies 


LVTTL 
IEEE 1394 LVCMOS 
USB 2.0 
Ethernet 
HomePNA HSTL-I 

HSTL-III 
Wireless Backplane WR 
Connection — Technologies SSTL-I 

Technologies Sy PCI SSTLI 
802.11b 3.3V PCI SSTL2-I 
802.110 3.3V PCI-X SSTL2-II 
HiperLAN2 GTL SSTL3-I 
Bluetooth GTL+ SSTL3-II 
HomeRF AGP CIT 


Chip to Memory 
Technologies 


HomePlug 


LVDS Interconnect 
Technologies 


PC Display 


¢ You require a high-bandwidth connec- 


tion from the source. 


¢ You need to perform a complex string of 
operations on the data. These can include 
decryption from a secure transmission for- 
mat, decoding into pixel maps, and opti- 


mization of these pixel maps for display. 


¢ You must use the resulting data to gener- 


ate the driver signals for the display. 


e Finally the display driver generates a fam- 
ily of signals that will be distributed to 
drive each individual pixel. The format of 
these signals varies with each technology, 
they have exacting timing requirements, 
and their specifications are often unique 


to each and every model. 


To make things worse this must all be 
achieved in real time, and while transiting 
this pipeline the dataflow will expand from 
about 25 Mbps (streaming HDTV) to 1.5 
Gbps (raw uncompressed 10801 HDTV 
display data). 


The optimization stage of the pipeline is a 


challenge in and of itself. Here the digital 


Streaming Media 
Formats 


MPEG-1 


Television 
Standards 


LVDS NTSC 
LVPECL PAL MPEG-2 
BLVDS SECAM MPEG-4 
SDIV 480i MJPEG 
SDTV 480p 
HDTV 720p 
HDTV 1080i 


Formats Real Networks 


QuickTime 
MS Media Player 


Digital Imaging 
File Formats 


Encryption 
Standards 


TIFF 

JPEG 
Scitex 
Targa 

GIF 





Table 1 - Standards and technologies spawned in the mass digital emergence 
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image data must be adapted to the specific 
characteristics of the target display technol- 
ogy. This is necessary because while all dis- 
plays operate on similar principles of color 
science, each has its own specific (and non- 
linear) behavioral characteristics. Thus, 
RGB data (which is most typically targeted 
for a CRT display) must be processed to 
display with acceptable results on an LCD, 
PDP, or other display technology. This pro- 
cessing can be as simple as color correction, 
or much more involved with algorithms 
applied for scaling, contrast, brightness, 
gradation smoothing, edge sharpness, 
shadow enhancement, and so on. Almost 
anything is possible, it simply takes ade- 


quate processing power. 
A Case Study 


To illustrate the challenges of digital dis- 
play design let’s analyze a case study exam- 
ple for a digital projector. In the generic 
case such products traditionally accept ana- 
log RGB video input, perform some mod- 
erate processing on the data, and then drive 
the projection display. This is typically 
effected through a variety of analog (blue), 
digital (black), and control (green) compo- 
nents as illustrated in Figure 1. 


With the advent of digital convergence, the 
next iteration of such a product may well 
be required to support some form, or 
forms, of direct digital input and include 
the ability to accept and display encoded 
file formats. In such a case, you face the 
dilemmas regarding which inputs and for- 
mats to support, and then you must select 
and integrate appropriate components to 
realize them in a design that meets per- 


formance requirements. 


Figure 2 illustrates an example for such a 
design that would support a fast serial USB 
2.0 connection as well as an 802.11 wire- 


less LAN connection. 


The simplicity of these illustrations belies 
the complexity of the task. How do you 
implement the new logic in the system 
controller to manage the new data flows? 
What interfaces and signaling standards are 
required in order to integrate the new com- 


ponents? What extensions to your user 
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interface and control software need to be 
developed? And, if it is determined that 
you need to implement and support a vari- 
ety of these technologies and options, your 


task becomes much more complex. 





RGB = 
Digital 

Video —> C i" a : violas 

Input pibhainaly RGB 


The Value of Programmable Logic 


Programmable logic is an ideal solution for 
addressing these challenges. By their 
nature these devices are flexible — the pre- 


mium requirement for success in this 





Figure I - Generic projection display system design 
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Figure 2 - Digital convergence projection display system design 
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Figure 3 - Programmable logic-based digital convergence 
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endeavor. In addition 
FPGAs are fast and effi- 


cient development plat- 


Figure 4 - FPGA 


utility in component 


forms, enabling rapid 
integration 
development cycles. 
Finally, modern FPGAs 
are extremely cost effec- 
tive, and_ therefore 
viable production solu- 
tions for almost any 


application. 


Figure 3 illustrates how 
an FPGA-based solu- 
tion could be used to 
affect our digital con- 
vergence projector. As 
you can see this design 
inserts an FPGA (illus- 
trated in red) and asso- 
ciated logic between the 
A/D 
existing system controller. In this design 


the FPGA serves as the I/O arbiter, accept- 


converter and 


ing input from all three sources. 


In operation the legacy digital RGB signal 
is simply passed through when this con- 
nection is active. In the case of USB 2.0 
and 802.11 however, the FPGA serves to 
manage these new interfaces completely, as 
well as decode the incoming data stream 
into the legacy digital RGB format. 
Decoding can be accomplished entirely in 
the FPGA or with the assistance of an 
ASIC or ASSP as appropriate (illustrated 
by the combined black/red block). 


This approach has several important 


advantages. 


¢ It retains the existing backend of the legacy 
design essentially unchanged. This bounds 


development complexity and reduces risk. 


¢The programmable bridge imposes no 
schedule penalties for numerous iterations. 
This can be a significant advantage when 
you are tasked with integrating new and 


unfamiliar technologies. 


¢Upon completion the design can be 


released and in production very quickly. 


How is all this possible? Figure 4 illus- 
trates some of the standard features and 


IP available in Xilinx FPGAs that make a 
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Chip fa , 


project such as our example 


relatively straightforward. 


On the perimeter of Figure 4 
is System I/O, which allows 
each and every I/O pin to be 
programmed to support any 
of 17 different signaling stan- 
dards. But System I/O doesn’t 
stop there. In addition to the 
basic signaling parameters it 
supports programmable drive 
strength and multiple slew 
rates too. These features make 
it easy to deal with unantici- 
pated PCB behaviors (in 
those not so rare cases where fabrication 
reality doesnt match design theory) as 
illustrated in Figure 5. 


Some FPGAs, such as the new Spartan™- 
IIE family from Xilinx, even go a step fur- 
ther including support for LVDS, BLVDS, 
and LVPECL differential signaling stan- 
dards at up to 400 Mbps per pin pair. This 
enables very high-performance component 
interconnection without the need to resort 
to higher pin count and more expensive 
packaging. Further, it reduces system 
power, lowers EMI, and is much less sensi- 


tive to noise as illustrated in Figure 6. 
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Figure 6 - Noise immunity benefit of LVDS signaling 
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Figure 4 also shows a representative sam- 
ple of the standard controller IP modules 
available for FPGAs. These are commer- 
cial quality function blocks that are avail- 
able to jump-start your design. Much can 
be accomplished with standard IP as solu- 
tions are available to address most of the 
topics listed in Table 1. 


For buffers and FIFOs you have a variety 
of on-chip memory resources to choose 
from. These include 200 MHz flip-flops, 
true dual ported Block RAM, a Shift 
Register Mode (SRL16) capability in the 
FPGA'’s fabric Look-up Table (LUT) struc- 
tures, and highly configurable Distributed 
RAM. These features provide high- 
performance and silicon-efficient solutions 


for almost any on-chip memory need. 


For clock management Xilinx FPGAs fea- 
ture four or more Delay-Locked Loops 
(DLL) per device. These provide the 
resources to synchronize and connect your 
system elements together and manage 
EMI. These DLLs exhibit superior noise 
immunity compared to PLLs, and they 
feature excellent jitter specifications, mak- 
ing your job easier. A few examples of 
their utility are illustrated in Figure 7. 


Finally, an array of Configurable Logic 
Blocks (CLB) and internal interconnect 
resources tie everything together. These are 
illustrated in Figure 8 and are the underly- 
ing fabric that make an FPGA an FPGA. 


The FPGA Way 


While the benefits of programmable logic 
are obvious as illustrated in our example, 
its value can be even greater when lever- 
aged as a foundation element in your 
design. Figure 9 illustrates how our digital 
convergence projector might look if it 


were designed from scratch, only this time 


the FPGA way. 


In this design the flexibility of FPGAs is 
being leveraged to maximum advantage. 
By designing the core logic of the system 
controller into an FPGA component you 
gain maximum flexibility in the selection 


of every other component you require — 


be they HSTL, SSTL, LVTTL, LVDS, or 


58 Xcell Journal 


whatever, they can be quickly and effi- 


ciently integrated. 


Another advantage in our example is the 
modular architecture for system input. In 
this design we could support a family of 
configurations to address a variety of geo- 
graphic and application requirements. 
Further, there is no reason why these can- 
not be developed in a serial fashion, 
enabling the most important configura- 
tions to get to market first. And, all that 
would change from configuration to con- 
figuration in the core design is the bit- 


stream program in the FPGA. 
With programmable logic in the data path 


you have tremendous ability to tailor 
encoding/decoding, encryption/decryp- 
tion, and image processing functionality 
to your precise needs. More importantly, 
you also have the flexibility to keep up 
with changes as these needs evolve over 
time. Take file decryption for streaming 
media as an example. Today there are no 
firmly established standards, and the stan- 
dards that do exist vary widely by geogra- 
phy and content provider. And remember, 
Content is king, and that using a pro- 


grammable device as the decryption 
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engine could allow you to support what- 
ever your customers require both today 


and tomorrow. 


When used as the heart of the display 
driver circuit, programmable logic can 
give your design the ability to support two 
or more display options. This can be of 
tremendous value in managing the cost 
for this high dollar bill of materials com- 
ponent (more than paying for the FPGA 
in many cases) or to support a family of 
products based upon a common core 
design that increases your accessible mar- 
ket. In addition, you can use LVDS to 
route these signals around the board 
(which quite often involves traversing 
large tracts of real estate) and thus mini- 
mize system level EMI and the impact of 


noise on these critical signals. 


FPGAs are also well suited for crafting a 
unique and attractive user interface for 
your design. They are the very definition 
of GPIO (General Purpose I/O) and can 
implement microcontrollers (or even a 
PowerPC microprocessor) for supervisory 
control. In today’s competitive markets 
the user interface can be one of the most 


effective ways to differentiate your prod- 


EMI Savings 






ULI. 
= [wel 


JUL 


Clock Multiplication & Division 
For Use Internally or Externally 
Benefit - Slower Clock on PCB 
Duty Cycle Correction 





Speedup Tc20 
Zero-Delay Internal Clock Buffer 
Benefit - Allows Use of Cheaper Memory 





Clock Phase Synthesis 
Internally or Externally 
Benefit - Faster State Machines, Double-Edged 
Clocking for DDR/QDR Memories 


JUL 
JUL 
JUL 


Clock Mirror Zero-Delay Board Clock Buffer 
Benefit - Reduce Board Delay, 
Reconstruct Nooisy Backplane Clock 


Figure 7 - Some examples of DLL capabilities 
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uct from that of your competition, and an 
FPGA gives you the maximum freedom 


to innovate. 


Finally, your FPGA based solution is 
never frozen. If a customer comes to you 
requesting a new feature, a slightly difter- 
ent capability, or a new con- 
figuration, you have a plat- 
form to quickly and effi- 
ciently deliver it. 
inevitable bugs and incom- 
patibilities crop up you can 
not only fix them quickly, 
you can also update deployed 
systems in the field. This can 
greatly reduce support and 
service costs. And if you ever 
face a supply problem for a 
system component while in 
production, you have the 
flexibility to find and sup- 
port an alternate solution to 
keep your factory running, 
your product shipping, and 


revenue coming in. 
Conclusion 


The era of digital convergence is upon us. 
From pictures to e-mail, from music to 
news, the world has gone digital. And 
because of this digital explosion today’s 


lOB 


systems require ever more connectivity and 
intelligence. It is no longer good enough to 
have the best widget or display. Now you 
need a more digitally connected and data 
manipulating widget or display, one that 


supports the standards and formats in your 


0B 10B IOB 10B |OB Sajal 


ICU 


| 
0) 
Bo 
we 
om 
2 
i | 
O 
‘BI 
IN 
@) 
B 
| 
10) 
3) 
| 
(0) 
B 





(0): un ©) = am (©) > am (©) 5 L@) = 


Figure 8 - FPGA CLBs and interconnect resources 


target market and geography — and one 


that does it before your competition. 


Programmable logic is an invaluable asset in 
confronting this challenge. Its inherent flexi- 
bility makes it an ideal mechanism for graft- 


ing new functionality into an existing design. 


Image 


Decoding Processing 


Programmable 


CLB Tiles 
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interconnect 


Its rich features, efficient development flow, 
and extensive IP support will simplify your 
job and give you a chance to meet aggressive 
schedules. The newest generations of devices 
are cost efficient solutions for almost any 
design. And when made a fundamental part 
of your architecture from 
the beginning, this technol- 
ogy can be exploited to 

0 modularize your configura- 
NX tions, provide flexibility 
with critical and costly com- 
ponents, and tailor your 
product functionality to 


your exact needs. 


To learn more about digital 
video and digital conver- 
gence technologies, and 
how FPGAs can help with 
them, visit the Emerging 
Standards and Protocols 
(eSP) Web portal at 
www.xilinx.comlesp. ‘This 
website was developed by 
Xilinx as a resource for the digital system 
design community and is specifically target- 
ed at dealing with these new and challenging 
technologies. To date, segments have been 
deployed for home networking, Bluetooth, 
and digital video technologies, and more are 
on the way. 
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Figure 9 - Maximizing the value of programmable logic 
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N of lo J ago,” 
the idea of creating an 
ASIC on the desktop 
was considered a 
novel concept. 
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by Robert Bielby 

Sr. Director, Strategic Solutions 
Xilinx, Inc. 

robert. bielby@xilinx.com 





Seventeen years ago, Xilinx developed the 
industry’ first Field Programmable Gate 
Array (FPGA). The concept was quite sim- 
ple — using Static Random Access Memory 
(SRAM) as the basis of the logic fabric, 
you could instantly develop integrated 
logic circuits directly on your desktop. The 
benefits were far reaching, because this 
technology did away with the risk and 
delay typically associated with designing 
custom Application Specific Integrated 
Circuits (ASICs). 


While the concept of “user programmable” 
logic was perhaps first pioneered by enter- 
prising engineers who used Programmable 
Read Only Memories (PROMs) as a “pro- 
grammable ASIC”, this solution was able 
to achieve only small levels of integration. 
The cost efficiency and logic density 
achieved by using PROMs as custom logic 
was extremely poor compared to ASICs. 
Interestingly, however, the concept of using 
look-up tables to implement logic func- 
tions is actually a basic technology used in 
today’s multi-million gate FPGAs. 
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Today, FPGAs have evolved into a main- 
stream technology because their current 
densities and performance compete with 
custom ASICs. Additionally, designing for 
an FPGA is almost exactly the same as an 
ASIC — your circuit can be described 
either by schematic, or by a high-level 
hardware description language such as 
VHDL or Verilog. This description is 
then synthesized for the FPGA with the 
final result being a configuration bit- 
stream that ultimately programs device 
look-up tables that are interconnected by 
programmable wires. Just like the look-up 
tables, the configuration bitstream also 
determines the connectivity for the pro- 


grammable wires. 


Because the programmability is based 
upon volatile, SRAM technology, there is 
a requirement to “re-write” the configura- 
tion bit stream every time power is 
removed and re-applied to the FPGA. 
This configuration bit stream is usually 
contained in an external, memory which 
is dedicated for configuration, or by 
reserving a portion of a main system 
memory with the FPGA configuration 


information. 


When FPGAs were first introduced, the 
fact that they required programming from 
an external memory source, and that they 
lost their configuration once power was 
removed, was generally considered a lia- 
bility. Fixed ASICs had none of these 
requirements or added complexities. And 
for the most part, ASICs were still signifi- 
cantly cheaper than FPGAs — at least on a 


unit cost basis. 


However, for applications that shipped a 
relatively low number of units, it could be 
shown that the high non-recurring 
charges for the ASIC more than offset the 
higher unit costs of the FPGA. Because of 
these cost constraints, the use of FPGAs 
was historically relegated to the lower vol- 
ume, less cost sensitive applications that 


had large-scale integration requirements. 


Over time, the additional benefits of being 
first to market and the ability to fix bugs or 
accommodate late specification changes 


rose in importance, and the decision to 


Spring 2002 


select an FPGA over an ASIC became 
common. This proved to be true even in 
cases where it could be demonstrated that 
an ASIC solution provided a lower overall 
solution cost than the FPGA. However, 
there were practical limits to how great the 
unit volumes or disparity in price would 


become before a design would transition 


from FPGA to ASIC. 


These fundamental benefits of being first 
to market and of risk aversion have proven 
to be especially important to the network- 
ing and communications industries — as 
demonstrated by the fact that currently 
70% of the $5 billion market for pro- 
grammable logic is consumed by network- 
ing and communications applications. 
FPGAs have been particularly important 
in these markets because they have been 
able to keep up with the pace and innova- 
tion in those markets and accommodate 
the wide range of new standards that con- 


tinuously emerge. 
Flexibility for the Masses 


Moore’s Law, which says that transis- 
tor density of an ASIC will double 
every 18 months due to advances in 
semiconductor technology, _ has 
played a key role in driving larger 
density FPGAs with greater features 
and performance. This has in turn 
continued to fuel the increased demand 
and consumption of FPGAs in network- 


ing and communications applications. 


The advances in semiconductor technolo- 
gy have not only yielded larger and faster 
FPGAs, they have also yielded FPGAs 
that are 10,000% cheaper than they were 
five years ago! The result is that FPGAs, 
which were previously practical only for 
prototyping or low volume, high-end 
applications, are now appearing in some 
of the hottest, newest high-volume con- 


sumer electronics. 


Four years ago, Xilinx developed the 
Spartan series family of FPGAs that were 
optimized for low-cost, high-volume 
applications. The results are impressive; 
from MP3 players, to DVD writers, digi- 


tal cameras, digital modems, and a host of 


other consumer electronics — FPGAs have 
rapidly become a key driver behind the 


digital consumer revolution. 


And reprogrammability, which used to be 
considered a liability, is now clearly seen 
as a key benefit in not only getting a 
product to market sooner, but keeping it 
longer in the market by providing the 
ability to upgrade it and add new features 
once its in the field. But how and why 
programmable logic is being used is 
sometimes just as interesting as where it’s 


being used. 
ReplayTV — Personal Video Recorder 


Personal video recorders (PVRs) are per- 
haps one of the most exciting consumer 
products to offer new features and capa- 
bilities made possible only by the combi- 
nation of digital technologies and FPGAs. 
With a PVR, traditional analog video pro- 
gramming is converted to digital using 


MPEG 2 encoding and stored directly to 





a hard drive — enabling instant search 


access and high quality video imaging. 


PVRs represent a quantum leap in capa- 
bility and quality compared to traditional 
videocassette recorders (VCRs). For exam- 
ple, a PVR such as ReplayTV’s can store 
as much as 60 hours of programming, 
allowing viewers to watch programming 
on their personal schedules instead of 


those set by broadcasters. 


The ReplayTV also contains an integrated 
56K-baud modem, which is used to 
download the equivalent of a TV guide 
that can be searched, sorted, and grouped 
like traditional database programs. This 
allows for easy recording setup and unique 
programming search capabilities. This 
modem connection also enables addition- 


al unique capabilities — such as reconfig- 
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uring the FPGA. The reprogrammable 
logic is updated simply by downloading a 
new bitstream. If the unit is already 
installed in a customer's home, the bit- 
stream is downloaded from ReplayTV’s 
Internet server. This allows bugs to be 
fixed even after the customer takes the 
unit home, and lets ReplayTV add new 
features as necessary. Obviously, this 
extends the life of the product too, 
because rather than having to replace it as 
market requirements change; the cus- 
tomer simply has the unit’s logic repro- 


grammed via the modem. 


ReplayTV reprogramming takes place in 
the background: each evening the PVR 
automatically downloads TV schedule 
information from the company server, 
along with any bug fixes, operating system 
updates, or program modifications. The 
customer's unit is improved as he sleeps 
without intervention on his part. End users 
are typically never made aware of changes 
or fixes to their systems unless the func- 
tionality of the PVR changes as new fea- 
tures are added. 
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In one case, ReplayTV found itself forced 


to deal with a condition that caused 
degraded video quality in a few systems 
already in homes — one of the chips in the 
system had an undocumented clock 
threshold switching problem that varied 
as a function of lot processing. Because 
the control signals for this device were 
generated in the FPGA, it was possible to 
eliminate the problem by changing the 
timing of the FPGA-generated signal. 


The company responded with a software 
change that was uploaded to all ReplayTV 
systems in the field as soon as it was 
debugged and certified. Most customers 
never realized that the change had been 
made; though some may have noticed 
improved video quality. A revolutionary 
capability enabled by FPGA programma- 
ble technology! 


KB Gear — JAMCAM 3.0 
Digital Camera 


Perhaps one of the least likely places one 
would expect to find an FPGA would be in 


a toy digital camera. But time to market pres- 
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sures and the risk of missing the narrow 
Christmas window of opportunity drove KB 
Gear Interactive, a manufacturer of Internet 
communications products and _ interactive 
gear for young computer trekkers, to chose 
the low cost Spartan™ FPGA because of its 
affordability and design flexibility. 


As a testament to the viability of FPGAs in 
such a cost sensitive consumer product — 
the JAMCAM 3.0, which cost just $99, 
was sold in over 14,000 retail outlets 
including Best Buy, Target, K-Mart, 
WalMart, Circuit City, and a host of other 
highly recognizable consumer retail out- 
lets. Because KB Gear had designed a toy 
that had broad appeal and used a Spartan 
FPGA, they were able to deliver one of the 
“must have” toys for the 2000 Christmas 
season on the shelves in time for the 
Christmas season. Years ago, the thoughts 
of an FPGA in a $99 kid’s toy would have 


seemed impossible. 
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Conclusion 


Seventeen years ago the idea of creating an 
ASIC on the desktop was considered a 
novel concept; they were primarily limited 
to prototyping and low volume produc- 
tion. The fact that the FPGAs needed to be 
programmed every time they were powered 
up was considered both a liability and a 
risk. Today, because of their lower cost and 
high flexibility, FPGAs have found a home 
in a wide range of applications. Xilinx 
FPGAs are now used in a wide range of 
high-volume, cost sensitive applications, 
especially those that require reprogramma- 
bility to meet the continuously changing 
standards and demands of the new digital 


consumer markets. 
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Through our XPERTS program, Xilinx certifies 
Syste many 3rd party design centers specializing in terabit 
applications. By taking advantage of industry-unique features such as the 
Selectl/O"-Ultra technology in the Virtex®-II Platform FPGA, Xilinx and its 
XPERTS partners provide the fastest and most flexible high-bandwith solution 
Supporting rapidly evolving connectivity standards. 


Our XPERTS partners have extensive experience using the popular Xilinx 
PCI 32/33,64/66 and PCI-X cores, and are ready to take on the challenges 
of designing with multi-gigabit serial interfaces using the 3.125 Gbps serial 
I/O technology in the next generation Virtex-II family. 
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SystemlO addresses all aspects of system connectivity 
Xilinx provides the most comprehensive SystemlO solution to address 
your interface needs ranging from “inside-the-box” to wide area network 
(WAN) applications. 


Using Virtex-I1 840 Mbps LVDS performance and an abundance of memory 
and logic resources, Xilinx is able to provide solutions for 1OGE MAC, PCI 
& PCI-X, RapidlO, POS-PHY Level 3 and 4, Flexbus 4, HyperTransport, 
and other source-synchronous bus standards. 


With SystemlO and our XPERTS program, you're better connected with Xilinx. 


For more information visit http:// www.xilinx.com/ xperts/systemio.htm 
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by Marco Pavesi 

Head of Innovative Designs Laboratory 
ltaltel SpA 

Marco.Pavesi@Italtel.it 


System verification is the major bottle- 






neck in the development of contempo- 
rary and future system-on-a-chip (SoC) 


Improve VOUT mney ae designs. Increasing numbers of applica- 
fo market with rapid | 


prototyping and system ~ 
veifcation enabled» 


tions that process large quantities of data 
in real time (such as telecom and video) 
require verification techniques that run at 
or near real-time speeds. Delaying your 
software development until working 


devices are available is not a viable option 





aul in the face of enormous competitive time- 


to-market pressures. 


Thus, early hardware/software (HW/SW) 
co-verification is not just practical — it’s 
essential. At Italtel SpA, we and our part- 
ners have developed a superior method of 
improving system verification confidence 
through high-speed register transfer level 
(RTL) prototyping and the Virtex™ fam- 
ily of FPGAs. 


Prototyping at Real Time Speeds 


RTL prototyping allows you to run system 
hardware and software at speeds high 
enough to hunt and find hidden bugs. Just 
as important, you can confidently evaluate 
subjective characteristics, such as audio 


and video quality. 


RIL prototyping uses off-the-shelf FPGAs 
to implement SoC custom logic, as well as 
test physical, real devices like memories, 


interfaces, and processors that comprise the 





other parts of the system to be verified. 
This strategy enables us to create, in effect, 

i a clone of the SoC and all the parts of sys- 
“wil tem that it interfaces with. We call this 
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assembling a “demonstrator.” With a 
demonstrator, we can map the hardware 
and run the application software. RTL pro- 
totyping can be roughly divided in two 


types, custom and modular: 


e Custom prototyping is the fastest tech- 
nique, in terms of frequen- 
cy. FPGAs and 


devices are assembled on a 


other 


board expressly designed 
for the demonstrator to be 
verified. Such a demonstra- 
tor may reach speeds as 
high as 200 MHz — but it 
requires several months 
from initial system design 
to the end of the verifica- 
tion process because of the 
intrinsic delay of the board 


fabrication process. 


Modular prototyping is not 
quite as fast as custom pro- 
totyping for verification — 
but when it comes to giving 
you the time-to-market 
advantage, it is, by far, the better solu- 
tion. With modular prototyping, you can 
assemble FPGAs and other devices on 
general-purpose daughterboards that 
allow you to create a wide range of differ- 
ent demonstrators. Modular prototyping 
is a HW/SW co-development scheme 
based on a set of configurable carrier 
boards where daughterboards can be 
inserted and interconnected as necessary 


during the co-design process. 
Avoiding the FPIC Dead End 
Modular RTL prototyping platforms have 


intrinsic speed limitations related to modu- 
larity, accessibility, and routability. Field 
programmable interconnect chips (FPICs) 
have been the traditional solution for 
routability problems. Unfortunately, the 
existing technology trends for FPICs are 
insignificant (in terms of speed and the 
number of I/Os) compared to the skyrock- 
eting speed and size of Virtex-II and 
Virtex-II PRO™ FPGAs. In short, no 
modular rapid prototyping platform based 
on FPIC technology can meet your cus- 


tomers’ need for speed and time to market. 
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To approach real-time system speeds, a 
modular rapid prototyping tool with new 
interconnection technologies and a novel 
topology had to be developed. Such a tool 
— with single-ended, point-to-point signals 
— would have a physical speed limit ranging 
from 80 MHz to 100 MHz. 





Figure 1 - Co-verification solution: Italtel FlexBench hardware with Temento Diaflex software 


Given these parameters, three European 
companies collaborated for two years to 
design and develop the FlexBench™ mod- 
ular rapid prototyping tool suite: 


¢Italtel SpA, the largest Italian telecom 
manufacturer, is the leader for hardware 


development. 


¢ Mistel SpA, another Italian telecom man- 


ufacturer, supports the hardware effort. 


e Temento Systems, based in France, is 
dedicating its electronic test automation 
(ETA) software resources to the testing of 


SoCs and electronic boards. 


Moreover, Oktet Ltd., a design service 
company based in St. Petersburg, Russia, 
was also deeply involved in the develop- 


ment the FlexBench verification system. 


Putting Together FlexBench Hardware 
As the FlexBench project leader, I had 


been searching for a practical means of 
modular rapid prototyping in a HW/SW 
co-design environment. In 1999, two 


new technologies emerged that enabled 


our team to create and patent the 
FlexBench concept: 


1. QuickSwitch™ bus switches from 


Integrated Device Technology (IDT), Inc. 


2. Mictor™ high-speed connectors from 


Tyco Electronics. 


These two technological 
innovations gave our team 
the means of reconfiguring 
multiple printed circuit 
boards (PCBs) as needed 
during the HW/SW co- 
design process. Employing 
the reliable and fast-grow- 
ing Xilinx FPGA technolo- 
gy, we envisaged it was pos- 
sible to design the unique 
FlexBench HW/SW. tool 


suite. 


The FlexBench challenge 
was to create a rapid proto- 
typing tool so general in 
purpose as to become the 
industry standard. Taking 
such a concept and turning it into a high- 
powered tool, however, is quite an under- 
taking that requires strategic alliances and 


adequate funding. 


We applied for — and received — start-up 
funding from the European Commission. 
With that funding, we worked in a frenet- 
ic manner for two years to gather the 
expertise and to execute the rapid proto- 


typing tool we envisioned. 


At the hardware level, the FlexBench rapid 
prototyping platform was to be basically a 
set of complex board designs. Fortunately, 
we had the PCB know-how and related 
resources in-house at Italtel — and valuable 


contributions from Mistel. 


On the software side, the development of 
the computer assisted engineering (CAE) 
software we needed was not our area of 
competency. Therefore, we entered into a 
partnership with Temento Systems, a com- 
pany known for its excellent ETA software. 
Temento developed the DiaFlex™ soft- 
ware tool we needed to monitor the 


FlexBench system in action. See Figure 1. 
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Our acquisition of Certify™ RTL parti- 
tioning software from industry-leading 
Synplicity™ Inc. completed our primary 


software tool set. 


Now that we had the boards and the soft- 
ware, we had to find the best programmable 
logic devices. It was a short search. In a class 
by themselves, Virtex-II Platform FPGAs 
from Xilinx perfectly fit the needs of our 
verification engineers: high speed, fast and 


simple compilation, and high capacity. 


You can see the FlexBench design chain in 


Figure 2. 


Multi-FPGA 


Partitioning Synplicity Certify 


Per-FPGA 
P&R 


Xilinx 





Figure 2 - FlexBench design chain 


FlexBench Rapid Prototyping at 100 MHz 


As we said earlier, to achieve the speed 
edge in RTL prototyping with a modular 
tool, the FPIC approach is simply not 
suitable. A novel technology had to be 
introduced, based on the speed of IDT’s 
QuickSwitch bus switches. By using pass- 
transistors, you can select interconnec- 
tions among neighboring HW modules 


on multipurpose panels. 


We built QuickSwitch bus switches and 
receptacles into the FlexBench mother- 
boards, which are populated by daughter- 
boards named FlexPlugs. These modules 
are based on an open standard form fac- 
tor. Using advanced, off-the-shelf inter- 
connect technology, FlexPlug modules 
make available up to 888 functional sig- 
nals. FlexPlugs host active devices, such as 
the leading edge Virtex-II 6000 Platform 
FPGA. The FlexPlugs also host power 
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converters. Memories are allocated on 
small modules, named MiniPlugs, direct- 
ly inserted into FlexPlugs in order to min- 


imize interconnect delays. 


We obtained the FlexBench speed charac- 
teristics by interconnecting different 
FlexPlug modules through the distributed 
network of pass-transistors. Each side of a 
FlexPlug contacts the side of the nearest 
other FlexPlugs by means of channels. 
Channel size is configurable, through the 
JTAG port, from 0 to 222 functional 
wires with 8-bit granularity. Trace length 
minimization is achieved through a clever 


three-dimensional architecture. 


The relatively short trace length of 
QuickSwitch delays (0.1 nanosecond) 
and the best-in-class set-up and clock-to- 
output parameters of Virtex-II FPGAs 
permits us to achieve 100 MHz clock 
speed (and over) on modular systems 


populated by up to 18 FPGAs. 


The FlexBench rapid prototyping equip- 
ment (RPE) is composed of a rack, a 
backplane (FlexPanel), a software-con- 
trolled clock generator (FlexClock), some 
carrier boards (FlexMothers), and several 
modules (FlexPlugs and MiniPlugs). 
FlexMother boards are 
through = flexible = printed circuits 
(FlexCable) and through the FlexPanel. 
See Figure 3 to see how the whole Flex 


connected 


hardware family interacts to form the 
FlexBench verification system. 
Figure 4 shows a populat- 

ed FlexMother. 


Figure 3 - FlexBench 
reconfigurable proto- 


typing system 
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Integrating DiaFlex Software 


Complementing the FlexBench hardware is 
the DiaFlex software tool developed by 
Temento Systems. Dedicated to configuring, 
downloading, verifying, and controlling the 
FlexBench RPE, the DiaFlex program suite 
allows you to design the RPE configuration 
through an intuitive, three-dimensional 
graphical user interface. Starting from RPE 
configuration, the DiaFlex application gener- 
ates a flattened description of the platform 
definition into a Verilog format, called a VB 





a 


Figure 4 - 
Populated FlexMother 


file. The VB file provides the Certify verifica- 
tion software with an easy view of the RPE 
resources and connectivity. Moreover, it pro- 


vides direct channel configuration bitstreams . 


The Certify program compiles RTL code, 
and spreads it according to the VB file 
description of the RPE. The Certify tool also 
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synthesizes all FPGAs. The FPGAs 
are then placed-and-routed via pro- 
prietary tools to produce FPGA con- 
figuration bitstreams. 


The DiaFlex application downloads 
all bitstreams to the RPE and provides 
interactive debugging through a 
TemTag™ JTAG PCI board. 
Moreover, the DiaFlex software pro- 
vides diagnostics, reports errors, and 
analyzes test results. This allows 
debugging at the signal name level, 
functional testing, and automatic 
management of IEEE 1149.1 test 
bench generation. Additionally, the 
DiaFlex program takes complete con- 
trol of the FlexClock board. This 
enables you to configure FlexClock 
parameters: the frequency of clock syn- 
thesizers, the selection of global clock 
sources, and the control of other tim- 


ing functions. 
A Flexible Prototyping Library 
To perform effective rapid prototyp- 


ing, you must have a set of FlexPlugs 
and MiniPlugs. These plugs enable 
you to interconnect state-of-the-art 
FPGAs, memories, I/Os, processors, 
design platforms, and other compo- 


nents of your design. 


While Italtel and our affiliates have 
designed general-purpose FlexBench 
modules, design-specific modules 
must developed by final OEM cus- 
tomer. This is where the services of a 
company like Oktet Systems can 
become essential. 


Sixteen general-purpose modules 
currently comprise the FlexBench 
library. Some of the modules are dis- 


played in Figure 5. 
A Matter of Observability 


Compared with the “observability” you can 
get with emulators, the classic argument 
against rapid prototyping has been that you 


(<9 


cant “see” what's going on inside the 
design, because invasive physical probing 1s 
difficult, if not impossible, to accomplish 


in modular equipment. 
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Figure 5 - FlexBench library 


vides the same functional controls 
of a sophisticated logic analyzer. 
With ChipScope ILA, you spend 
less time verifying chip function- 
ality, and therefore, speed up your 


time to market. 


By means of the ChipScope ILA, 
you get full visibility into the 
FlexBench RPE. ChipScope ILA 
lets you see every selected node in 
every FPGA used in your design. 
This virtually eliminates the need 
for invasive physical probing of 
the FlexBench RPE. With the 
Xilinx ChipScope ILA solution, 


you can perform real-time, on- 


Target FPGA with ILA Cores 


JTAG 


JTAG 
Connection 





Figure 6 - ChipScope and FlexBench solution 


Xilinx has removed this weakness thanks to 
its Integrated Logic Analysis (ILA) core, a 
solution that provides trigger and trace cap- 
ture capability within the FPGA itself. 
Implemented by the Xilinx ChipScope™ 
Analyzer, the ILA core allows real-time 
access to any node in the FlexBench chips. 


An easy-to-use graphical user interface pro- 








Target FPGA with ILA Cores 


Target FPGA with ILA Cores 


chip debugging. Figure 6 shows a function- 
al representation of how ChipScope ILA 
facilitates the debugging of a modular 
FlexBench design. 


Conclusion 


Currently, some functional FlexBench plat- 
forms have been developed and are being uti- 
lized in Italtel development and innovation 
projects. Regardless, the FlexBench mission is 
not to become a proprietary tool, but to 


become an industrial standard. 


Italtel/Temento are and will be engaged 
in trials with major silicon vendors to 
demonstrate and to prove the FlexBench 
technology. We believe the versatile 
FlexBench model is a major step in rapid 
prototyping and system verification. We 
are proud to have designed it. To ensure 
best diffusion of this tool, we are now 


researching the best sales channel. 
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Multiprotocol Modular Engineer 
Solutions Plattorm Otters L 


The Nu Horizons Engineering Solutions Platform presents a 
costetfective alternative to single-board tools for developing 
high-end data communication and server applications. 


by Bill Pratt 

Regional Applications Manager 
Nu Horizons Electronics 
Bpratt@NuHorizons.com 


Today's one-sided trend toward shorter sys- 
tem development cycles and increasingly 
complex designs are driving the need for 
more timely system validation. Although 
designers recognize this weighted ratio, and 
are turning more and more to Xilinx 
FPGAs for their designs, they have also 
recognized a need for development boards 


that are up to the challenge. 


Nu Horizons Electronics Corp. under- 
stands this short-cycle/high-complexity 
design ratio and has introduced a line of 
hardware development platform tools that 
facilitate system verification within an inte- 
grated environment. The Nu Horizons 
Engineering Solutions Platforms are based 
on a modular design concept that allows 
the re-use of expansion cards with other 
interface expansion cards to create a unique 


development evaluation environment. 
Modular Design Solution 


The Engineering Solutions Platform solu- 
tion consists of up to three boards in a mod- 
ular environment. With this modularity, you 


may combine a number of different expan- 
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Virtex-II daughtercard 


sion cards for multiple solutions, or you can 
build a total solution in a modular environ- 
ment. The days of purchasing a develop- 


ment board for a single application are gone. 





Engineering Support Platform motherboard 


Motherboard 


The primary function of the Engineering 
Solutions Platform motherboard is to 
provide a high-speed backplane for 
board-to-board data transfers. This back- 
plane is called the Horizon Bus. With the 
high-density board-to-board connectors, 
you can evaluate the functions of two 


expansion cards, which allows emulation 










hry 


of a total solution. The Horizon Bus has 


an aggregate bandwidth of 26 Gbps, and 
is highly suited for developing solutions 
for the telecommunications and data net- 


work markets. 


The main board, or motherboard (Figure 1), 
has the following features: 


e Xilinx XCR3128XL -10PQ VQ100 
CoolRunner™ CPLD 


¢ High-density board-to-board connectors 


for two expansion cards 
¢ Dual seven-segment LCD display 
e RS-232 level shifter for UART core 


instantiation 
¢ JTAG configuration headers 


¢ Two 44-pin PLCC sockets for Xilinx 
configuration PROMs 


¢ Fifty test points 
¢ Power management 
¢ Reset controller 


e Status LEDs. 
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Virtex-II Expansion Card 


The Xilinx Virtex™-II expansion card 
(Figure 2) is a cost-effective development 
board with a broad suite of features. 
Available now is the XC2V1000 FG456 
one-million-gate Virtex-II device with the 
following features as implemented on 

the Engineering Solutions Platform 


daughtercard: 
e 16 channels of LVDS interface 


— User-configurable as eight trans- 
mit and eight receive or 16 


transmit channels 








¢ Multiprotocol serial clock/data 
transceivers with auto cable termi- 


nation 


— Certified TBR-1, TBR-2, NET-1, 
and NET-2 compliant. 


¢ 16 megabits of synchronous 


NBT SRAM 
— Flow through and pipelined 


e Programmable clock from 1 MHz 
to 200 MHz 


e Six user clock inputs 


¢ High speed 133-bit at 200 MHz 
backplane 


¢ DCI (digitally controlled 


impedance). 


Design Platform for 
High-End Applications 


Targeted at high-end data communi- 
cations applications, the Virtex-II 
daughtercard is applicable to many 
scenarios, including SPI-3 to CSIX 
bus interface or offering transparency 
within high-end networking solutions. The 
16 channels of high-performance LVDS 
(low voltage differential signaling) provide 
the capability to prototype high-speed dif- 
ferential interfaces such as the SPI-3 and 
SPI-4 bus technologies. The Receive port 
includes parallel termination; the Transmit 


side does not require this feature. 


The multiprotocol clock/data transceivers 
with auto cable termination can support 


multiple protocols such as two channels of 
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V.35 HDLC (high-level data link control). 
Coupled with the HDLC core from Xilinx, 
you can turn your Virtex-IJ FPGA into a 
powerful network engine. The multiproto- 
col clock/data transceivers also support 
X.21, V.11, RS-232, RS-449, and RS- 
53W\ serial interfaces. 


DIES 0) F\Nf 


Jumpers, 
Pin Headers 








CoolRunner 


BOE Regulators 


Driver 


Configuration PROMS 


Figure I - ESP motherboard block diagram 


LVDS Header, 
Pin Headers 


Reset 





SUL trey al 







Virtex-ll 
XC2V1000 







MSPC Voltage 


Chipset 


Figure 2 - Virtex-II daughtercard block diagram 


NBT (no bus turnaround) RAM offers a 
standard packaging for high-speed SSRAM 
devices. The Virtex-II daughtercard has 16 
MB of NBT SSRAM from GSI Technology, 
which allows for zero-wait, read-write bus 
utilization. This GSI SSRAM is program- 
mable as FT (flow through) or pipelined, 
operating as a pipelined synchronous device. 
This means that in addition to the rising- 
edge triggered registers that capture input 
signals, the device incorporates a rising-edge 


triggered output register. 






The daughtercard also incorporates an 
ICS525-02 user-configurable clock chip 
from Integrated Circuit Systems Inc. The 
ICS525-02 is the most flexible way to 
generate a wide range of highly accurate 
clock output from a standard crystal or 
clock oscillator. You can easily program 
the ICS525-02 to output a frequency 
from 1 MHz to 200 MHz by setting a 
bank of switches located on the board. 
You may also supply a LVDS clock, as 
well as four other single-ended user 
supplied clocks, for a total of six usable 
clock inputs. 


With the Horizon Bus backplane, the 
Virtex-II daughtercard becomes a very 
powerful development platform offer- 
ing up to 26 Gbps throughput and 
allowing the instantiation of a multi- 
tude of bus architectures, such as CSIX 


for telecommunications. 


Nu Horizons Electronics Corp. is in the 
process of developing new daughter- 
cards for the Engineering Solutions 
Platform. Future cards include an 
HTML server/white appliance con- 
troller and a high-speed AFE (analog 


front end) with an Ethernet interface. 
Conclusion 


The Engineering Solutions Platform 
from Nu Horizons Electronics Corp. is 
a modular Virtex-II development envi- 
ronment that provides designers with 
the hardware necessary to develop, pro- 
totype, verify, and test their designs 
before they are finalized. Intended to 
accelerate the design cycle timeline, the 
Engineering Solutions Platform is a 
modular solution that enhances and sim- 
plifies system design and_ validation 
processes. Its modularity makes it highly 
functional over many design cycles, thus 
providing a cost-effective alternative to sin- 
gle-board tools for developing high-end 
data communication and server applica- 
tions. The Engineering Solutions Platform 
motherboard with the Virtex-II daughter- 
card sells for $999 and is available for 
immediate shipment. For more informa- 


tion, please log onto www.nuhorizons.com. 
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Plug in your high-densit FPGA 
ug in your high-density : 
And get ready. You’ve never seen performance like this. Introducing Amplify® Physical Optimizer” — the first and 
only physical synthesis solution for programmable logic. Now you can achieve aggressive performance goals, and save 
weeks while you’re at it. Easy to learn and use, Amplify delivers better Quality of Results by utilizing both physical and 
timing constraints during synthesis. As a result, designers are achieving up to 35% performance gains, on or ahead of 
schedule. Plus, Amplify supports and enhances Team Design. Not only does it manage the physical hierarchy of 


a design, Amplify optimizes performance, regardless of how a design is allocated across the team. Get in on the 
outstanding performance. For more information and an evaluation copy, visit www.synplicity.com/amplify. 


able World 2002, and attend our upcoming webcast seminar featur; 
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Parallel Cable IV” 
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onnects Faster and Better 


This new high-speed download cable supports ultra-low voltages. 


by Theresa Vu 

Product Marketing Engineer 
Xilinx, Inc. 

theresa. vu@xilinx.com 


The new Parallel Cable IV downloads data 
more than 10X faster than the previous 
JTAG/Parallel Cable III. Now, you can 
download to one XC2V8000 Virtex!™-II 
FPGA in less than eight seconds. 


The PCIV features ultra low-voltage 
support for all Xilinx FPGA, 
CPLD, System ACE™ MPM 


(multi-package module), and 


ISP (in-system programmable) 
PROM devices (see Table 1). 


The PCIV connects to any desk- 

top or laptop computer using the standard 
IEEE 1284-compliant parallel port and 
draws power directly from the computer 
(through the mouse/keyboard port) or an 
external power supply. 


A robust ribbon cable ships with the PCIV, 


Parallel Cable IV 
Download Speed 


Download Modes 


Power Supply 


Board Connections Ribbon Cable and Flying Wires 





1/0 Voltage Support 


A ieee! Geo Gn! | 


which gives you the flexibility to use either 
the JTAG (EEE 1149.1) or Slave Serial 
download mode at the fastest speeds. The 
small profile of the ribbon cable connector 
minimizes the need for board space. The 
ribbon cable offers an error-free, quick con- 
nect target interface compared to cumber- 


some flying lead wires of Parallel Cable III. 


The PCIV is backward compatible with 
the PC III and offers a connector for 
flying lead wires. 


PCIV extends Xilinx leadership 
in pre-engineered configura- 
tion solutions by offering a fast, 
simple, low-cost download 
solution for all Xilinx FGPA, CPLD, 
System ACE MPM, and ISP PROM 
devices. The Parallel Cable IV will be 
available in late March through Xilinx 
distributors and the Xilinx e-commerce 
site for $95. For more information, see 


www.xilinx.com/support/programr/cables. htm. 


Parallel Cable III 
up to 300 Kbps 
2.5V, 3.3V, 5V 
JTAG (IEEE 1149.1) and Slave Serial 
Target System 
Flying Wires 
iMPACT 





Software Support iMPACT 


Table 1 - Parallel Cable IV vs. Parallel Cable IIT Parallel Cable IV with ribbon cable connector 
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by Aki Niimura 

Consultant 

Ponderosa Design 
ponderosa_design@pacbell.net 


As the complexity of FPGA-based systems 
grows every year, we are asked to imple- 
ment larger, more complex functionality 
within tighter schedules. Furthermore, the 
type of design has changed rapidly in 
recent years. People used to design an 
FPGA taking an existing board design, 
often containing asynchronous clocks. 
Those days are over. You can not design 
today’s Platform FPGA just by extending 
yesterday's design practices. New designs 
often require the implementation of com- 
plex sequences or communication proto- 
cols. The finite state machine (FSM) is a 
well-known design methodology to imple- 
ment such sequences. FSM is very effective 
when the sequence is not very complex. 

However, implementing a 


complex sequence using 








Complexity ; 
an FSM is not prac- 


tical and is often 
difficult to 


maintain. 


Microcontroller 


Microsequencer 


Performance 


Figure 1 - Performance versus 
functional complexity 
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implement complex protocols. However, 
they require substantial resources (mem- 
.), which can be difficult 


to justify in real-life situations. On the 


Ory, Cost, pins, .. 


other hand, software implementations 
allow designers to cope with mounting 
logic complexity. They are easy and quick 


to implement and easier to maintain. 


There is a gap, however, between the 
range of design complexity that FSM 
methodology can handle and what 
microcontroller-based methodology is 
good for, as shown in Figure 1. As micro- 
controllers become more powerful, the 


gap is widening. 


The scc-// is not just another set of 
microcontrollers. We specifically con- 
structed the microsequencer to fill the 
gap between low-level FSM solutions 


and high-level microcontroller designs. 
scc-I] — A Configurable Microsequencer 


Sequencers have been used in many LSI 
projects to implement functions. For 
CISC 


(Complex Instruction Set Computer) 


example, instructions in a 
CPU were often implemented in this way 
(called microcode, which is written in a 
proprietary assembly language). By allow- 
ing users to write programs in a high-level 
language, the scc-/7 can accommodate a 


wider range of FPGA applications. 


een — 


C controller would be overkill, 


ig 


” E 








The key architectural benefits of the 


scc-J// are: 

¢ Small footprint 

¢ High-level language support 

¢ Small code size 

¢ Configurable and customizable 

¢ Capable of handling 16-bit and 32-bit 
data types 

¢ Timer (integrated into the core 
architecture) 

¢ Support of interrupt handling 

¢ Developing and debugging tools 

oa OlulivzveCoyeMoyep,Gliveyanyjertacvemnucgs m-tere! 


Virtex!™-IT devices. 


A block diagram of the scc-// is shown in 
Figure 2. The core itself requires 400 to 600 
LUTs, depending on the configuration and 


synthesis constraints. 
How the scc-/7 Works 


The scc-// employs a stack-based architec- 
ture. Stack computers use data stacks to 
evaluate given operations (Figure 3). The 


benefits of stack-based architecture are: 


¢ High-level language ready — can execute 


syntax tree directly 


¢ Simple hardware — easy to understand, 


easy to customize 


e Small instruction code — most scc-// 


instructions are one byte long. 
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Another unique aspect of the scc-//architec- 
ture is the use of register windows. Register 
windows are used to pass arguments to a 
function being called. Because the scc-// 
does not use a stack frame in memory to 
pass arguments, the scc-// does not require 
data memory to run a high-level language 
program, thus making the scc-// more 


attractive for Platform FPGA applications. 


Programs for the scc-// are almost entirely 


written in the high-level language SC. 
The Language SC 


The scc-// assumes the use of a high-level 
language. However, existing high-level lan- 
guages are not designed for microsequencer 
applications. Therefore, we developed a 
stripped-down version of C language — SC. 
SC programs do not support “struct” and 
other complex data types, but SC has sever- 
al enhancements to describe control appli- 
cations efficiently. Timer, I/O, and debug 


features are natively supported in SC. 


The following is a code fragment from a 


project that controls an SDRAM memory. 


void 
init_sdram() 
{ 
while(!eval_cond(DLL_RDY)) { } 


wait(14); // 286 uS 
outp(SDCMD, SD_PRE); 
repeat(8) { 

outp(SDCMD, SD_AREF); 

} // tRC = 84nS; 20nS * 5 
outo(SDCMD, SD_MODE); 


In the above code fragment, eval_cond(n), 
wait(n), and outp(port) are not function calls, 
but they are natively supported by SC. Note 
that the loop counter of the repeat state- 
ment is placed in the data stack and not in 


the register file. 
scc-IT Target Applications 


The scc-// can be used in designing func- 
tional blocks to perform procedural con- 


trol. For example, flow charts or simple 
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// wait for DLL is ready 


// looping takes 7 cycles 
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ROM 


Interrupt 
logic 


Figure 2 - scc-II block diagram 
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push a 


Figure 3 - How stack computers operate 


arithmetic functions (such as averaging) 
are good candidates for implementation 


by the scc-l. 
Other target applications include: 


¢ Read/write flash, 
EEPROM, 1-Wire™ 


devices 


¢ Interface to I2C, 
RS-232, Ethernet, 
USB1.1, IrDA 


¢ Command interpreter 
(a block is controlled 


through commands) 


e User interface (such 
as keypad, LCD, 
touch panel) 


¢ Servo controller (some 
arithmetic operations 


required) 
¢ Design that requires many variants. 
The scc-// and Virtex FPGAs 
The Virtex family of FPGAs are true “sys- 


tem on a chip” platforms. The advanced 
technology available in Virtex-II devices 
provides further attractive features to the 


scc-/T, including: 


¢ The scc-// can run at 70 MHz or faster. 
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¢ Larger Block RAM allows larger program 
sizes (up to 8 KW). 


e Native multiply operators are supported. 


Case Study — Web on FPGA 


To demonstrate the effectiveness of the 
scc-/T solution, we have developed a Web 
server that uses less than 25% of the 
resources of a  Spartan-II FPGA 
(XC2S150). The only additional hardware 
required beside the FPGA are an Ethernet 
PHY device and a signature ROM (option- 
al). We found that the Spartan-II VoIP 
Development Kit from Insight Electronics 


(www. insight-electronics.com) included all 








Figure 4 - WebThermo demo application 


the hardware components we needed. Thus, 
we decided to use this off-the-shelf board to 
create our Web server design. Figure 4 


shows a screenshot from a Web server proj- 
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ect called WebThermo. The screen displays 
the current temperature every minute. 
Table 1 shows utilization statistics from a 


Synplify analysis of the XC2S150 device. 


WebThermo Usage of XC2S150 Note 


828 (23%) Synplify 7.0 


for TX, RX buff 
Block RAM a 
5 for Program 


320 (18%) 


Table 1 - WebThermo logic size 


The 2 KB WebThermo program imple- 
ments all Ethernet, TCP/IP and Web 
(HTTP) protocols, as well as Celsius-to- 
Fahrenheit conversion. At start up, the pro- 
gram retrieves a unique 48-bit ID code 
from a Dallas 1-Wire device (DS18S20), 
which is used as an Ethernet MAC address. 
For further details on the WebThermo proj- 


ect, please visit home.pacbell.net/akineko/. 
Program Development 


One challenge of the scc-// solution is in 
providing reasonable program development 
and debugging tools. Figure 5 illustrates a 
typical program development flow. In addi- 
tion to key software tools, we wrote many 
scripts and templates to automate the 
design process. While creating several proj- 
ects with the scc-//, we refined the RTL 
design, as well as the development software 
and scripts. As a result, they have become 


mature and stable. 


Currently the development environment is 
supported under Unix. It is also possible to 
port some of the tools to Windows platform 
using Cygwin from Cygnus (RedHat). The 
tools are developed assuming that the user's 


RTL design is in Verilog HDL. 
Debugging, Then Debugging Again 


Debugging is the biggest challenge in devel- 
oping an scc-// based design. We are pro- 
viding several debugging aids: 


1. Debugging starts with simulation: 


— Three debug instructions (print, $dump, 
$stop) 


— Self-checking embedded in the code 
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— Execution trace log generation 


— Dis-assembler to display current con- 
text (on-the-fly/offline) 


2. Ready to try on the board: 


— JTAG debugger to download program 
without backend (synthesis + PAR) 


— UART customized for debugging (one 
can use printf()) 


— “xdl” script to replace ROM contents 
without backend. 


Lessons learned: 


1. Logic simulation is always the best tool 


for debugging. 


2. printf() is a primitive but very powerful 
means for debugging. 


3. Use #ifdef ... #else ... #endif to switch 
between debug and release. 


4. A bigger vehicle is needed for debug- 
ging (you may need 2 KB to develop a 
1 KB program). 


JTAG debugger 


The JTAG debugger (jtagdbg) has proved 
to be a powerful tool to facilitate the debug- 
ging process. The JTAG debugger uses the 
Virtex USER1 JTAG command to commu- 


Technology Fos MT 


nicate with a Virtex FPGA. By substituting 
the instruction ROM block in the scc-// 
design with a JTIAG embedded ROM 
block, you can perform several debug com- 
mands, such as downloading a program 
without going through the FPGA backend 
design process. No signal change is 
required, as JAG signals are hidden from 
your RTL code. 


Conclusion 


We have presented a microsequencer, the 
scc-//, which is new to conventional FPGA 
design practices. Unlike other IP cores, the 
potential of the scc-// is not limited to its 
original form. Rather, the scc-// can evolve 
to meet each application challenge. One 
avenue we plan to explore is adding Galois 
instructions to the original scc-// core. This 
enhancement can help in error correction 


or security applications. 


Another avenue we plan to pursue is project 
automation, such as a wizard script that sets 
up project directories and tools — and then 
creates a skeleton version of RTL code, as 
well as skeleton SC program and header files. 


The complete scc-// design solution 
is offered by Ponderosa Design 


in Sunnyvale, California Please write 


ponderosa_design@pacbell.net or visit 
http://home.pacbell. netlakineko/. 





Figure 5 - The scc-II program development flow 
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Your FPGAs are getting as complex as ASICs. 


But are your FPGA tools ASIC enough? 





FPGA applications are evolving 
into something more complex, 
more challenging - more ASIC-like. 

That's why Synopsys offers a suite of 
proven tools for your advanced FPGA 
designs. With synthesis, simulation, 
Static timing, and more, you'll get a 
comprehensive solution that can help 
you achieve high-performance results 
ona tight product schedule. Start your 
evaluation at synopsys.com/fpga. And 
see what the leader in ASIC design and 
verification can do for your FPGA. 


See us at the Xilinx 
Programmable World 2002 
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FPGA Compiler II’s unique algorithms aid 
in designing chips correctly and on time. a 


by Jackie Patterson 
Director ot Marketing Programs 
Synopsys, Inc. 
jackiep@synopsys.com 


When you target Virtex-II Pro™ Platform 
FPGA, you are using one of the best 
FPGAs on the market — and you 
need a high-performance synthe- 
sis tool to match. That's why 
Synopsys Inc. is focusing efforts 
on its premier FPGA Compiler 
II™ FPGA synthesis tool and 
continuing to hone its support 
for top-of-the-line programma- 
ble logic devices. In 2002, FPGA 
Compiler II is superceding 
FPGA Express™, which will be 


discontinued. 


This article will explain the differences 
between the two synthesis tools, summarize 
key features carried forward from FPGA 
Express to the FPGA Compiler H, and 
explain how you can upgrade to FPGA 
Compiler II today. 


FPGA Compiler Il Has Unique Capabilities 
FPGA Compiler H (FCI) was developed 


for high-performance devices such as 
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Figure 1 - FCI adds flexibility to 
high-performance synthesis engines. 


Virtex-II Pro Platform FPGAs. FCII com- 
bines the architecture-specific synthesis 
engine of FPGA Express with advanced 
technologies suitable for ASIC-like design 
challenges, as shown in Figure 1. These 
leading-edge capabilities are unique to 
FPGA Compiler IJ and not found in any 


other synthesis tools on the market. 


Common 
Synthesis 
Technology 


Register Retiming 


FCII uses sophisticated retiming algo- 
rithms to boost clock speed automatically. 
Retiming works by analyzing all the com- 
binational logic in the design, and then 
selecting the optimal register placement to 


meet your design goals. See Figure 2. 


Retiming can also pipeline your design 


automatically. Just code in the number of 


5 Tool to Maximize ® 
ertormance 


FPGA Compiler II 


Register Retiming 
and Pipeline Insertion 


db Output 


dc _ shell 


Translator 
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register banks to match your latency 
requirements, and FCII does the rest, mov- 
ing the registers into the optimal position 


to form your pipeline. 


Register retiming is easy to use in FCII. All 
you have to do is set the retiming variable 
in your script, and FCI will automatically 
position your registers in the optimal places 


to maximize clock speed. 


current_chip my_chip 
set_chip_retiming -enable 


You can also use the FCII graphical 
interface to invoke retiming, if you 
prefer. Just select Edit->Constraints 
from the menu and check the 
retiming box in the Xilinx vendor 
options tab. Synopsys full-chip 
retiming optimizes performance of 
all datapath, control, and random 
logic in the high end of the Xilinx 
line — the Virtex™, Virtex-II, and 


Virtex-II Pro families of devices. 
DesignWare Foundation IP Library 
ASIC designers save design time by 


reusing components from the Synopsys 
DesignWare Foundation IP library. 
Although some components in the library 
are also useful for production FPGA 
designs, the biggest benefits come for 


those who are using a Virtex-II Pro device 
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FCII improves both 
datapath and control logic 
while preserving 
pin-to-pin functionality. 


FCII allows easy 


Inputs Outputs 


pipeline_design 


automatic pipelining; 
insert registers, 
then apply timing 10 ns => 


Circuit After FCII Register Retiming 


Inputs Outputs 


Circuit After FCII Register Retiming 





Inputs Outputs 


Figure 2 - FCII register retiming optimizes performance. 


to prototype an ASIC or SOC. For a pro- 
totype, FCII’s ability to implement the 
DesignWare components in the FPGA 
provides confidence that the ASIC will 
work as planned, because it allows you to 
verify exactly the same IP that is used in 
the ASIC. 


Full Flow Support 
FPGA Compiler IL allows designers to take 


advantage of other high-performance tools 
for FPGA design and verification. When 
undertaking large, complex Virtex-II Pro 
devices, designers need to verify the silicon 
before it gets into the lab. That process 
starts with a quick analysis of the source 
code using the LEDA® HDL checker. The 
LEDA checker leverages general-purpose 
rules along with Xilinx-specific rules to 
alert designers to potential problems and 
optimization opportunities in the HDL. 
Then the HDL functionality is verified 
using a fast simulator such as VCS™ 
for Verilog or Scirocco™ for VHDL. 
After the optimization by FPGA 
Compiler II, designers can use 
Formality® to formally verify the 
design, avoiding time-consuming sim- 
ulation runs. Of course, the timing of 
the design is comprehensively checked 


through static timing analysis. In this way, 


Spring 2002 


Figure 3 - Synopsys design flow 
for Virtex-IT Pro Platform FPGA 


Synopsys FCII customers can leverage all 
available means to get their designs done 


correctly and on time. 


Architecture-Specific Synthesis Engine 


FPGA Compiler II carries forward the 


same architecture-specific 


synthesis 


FPGA Synthesis 


engine you've relied on in FPGA Express. 


This includes: 


¢ Support for all Xilinx devices through 
Virtex-II Pro and beyond 


¢ Block Level Incremental Synthesis (BLIS) 
to speed your design cycle time by re- 
doing only the blocks in the design that 
have changed 


¢ User control of register duplication to 
eliminate critical paths caused by high 


fan-out nets 
¢ Networked licenses on both UNIX and PC 


e Integrated schematic viewing and tim- 


ing analysis 
¢ Full TCL scripting 
¢ ROM inference. 
Upgrading to FPGA Compiler II 


FPGA Compiler II is the choice for syn- 
thesis of high-performance FPGAs such as 
the Virtex-II Pro pf1. 


To upgrade to FPGA Compiler II, contact 
your Synopsys account manager or visit 


WWW. SYNOPSYS. com/fpga. 
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| 0) Multi-Gigabit 
to Double Your FPGA Bandwidth ye 


Virtex-ll Pro Platform FPGAs break open the 1/0 bottleneck. 


by Brian Von Herzen, Ph.D. 

CEO, Rapid Prototypes, Inc. 

a member of Xilinx XPERTS Program — 

Xilinx Worldwide 3rd Party Certitied Design Centers 
Brian@FPGA.com 


As FPGAs increase in size and performance, 
I/O resources become the main bottleneck 
to FPGA performance. Although the effec- 
tive area of a chip grows as the square of the 
feature size, the perimeter I/Os grow only 
linearly. State of the art designs require 


higher performance I/O modules. 


In response to this increasing demand on 
I/O resources, Xilinx has developed novel 
I/O structures called Rocket I/O™ multi- 
gigabit transceivers (MGTs) that enable 
order-of-magnitude increases in I/O per- 
formance. The Rocket I/O MGTs double 
the total I/O bandwidth of the Virtex-II 
Pro™ family of devices using only a few 


percent of the pins. 


With up to 16 MGTs per device, the Virtex- 
II Pro achieves an additional 100 gigabits per 
second of I/O bandwidth in the larger 
devices over what is available with the gener- 
al-purpose I/O blocks. Rocket I/O MGTs 
enable multiple gigabit I/O standards and 
maximize performance for FPGA-to-FPGA 
communications. Even though Rocket I/O 
MGTs dramatically increase performance for 
demanding applications, they are easy 
enough to use for simple FPGA-to-FPGA 
communications with special soft macros 
such as the Aurora core available from Xilinx. 
The interface has been simplified to the 
extent that no external resistive termination 
is required with the Rocket I/O MGTs. The 
transceivers can be internally configured to 


match 50Q. or 75Q, transmission lines. 
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Multipliers and 


MGTs Onboard 
The Rocket I/O MGTs are shown in 


Figure 1, which illustrates the overall 
Virtex-II Pro architecture. The MGIs are 
located above and below columns of Block 
RAM, providing close availability of Block 
RAMs for ingress and egress FIFOs. As 
many as 16 MGTs are integrated on each 
FPGA above and below the Block RAM 
columns. The clock distribution networks 
can feed these transceivers for low-skew 
clock alignment between MGTs. 


Rocket I/O 
DCM High-Speed Transceiver 


S 
aim (AN 
= fs) 
| 
eo ee | 
4 
3 
aa) 


ta c.> i 
thar 
iil 


Select I/O - Ultra 





Figure 1 - Virtex-II Pro FPGA architecture, 
including up to 16 Rocket 1/O MGTs 


MGTs = Multiple Gigabit Standards 
The Rocket I/O MGTs have been designed 


to be compliant with: 

¢ Gigabit Ethernet 

¢ 10 Gigabit Ethernet XAUI 
¢ Fibre Channel 

¢ InfiniBand™ Architecture 


e Xilinx Aurora core 











ngineering 
hird Parties 


Configurable hardware support is 
provided for: 


¢ 8B/10B encoding 
¢ Disparity control 


e Transmitter and receiver termination 


impedance 
¢ Pre-emphasis 
¢ Amplitude control 
¢ Loopback testing. 


These are the essential configurable hard- 
ware features that enable compliance with 
all of the main multi-gigabit signaling stan- 
dards. You can find more information on 
Virtex-II Pro standards support on the 


Xilinx website at www.xilinx.com/partin- 


fo/databook.htm. 
MGTs Speed up FPGA Communications 
Although MGTs have many important 


applications interfacing to industry stan- 
dard gigabit communications protocols, 
they can serve another important function 
in boosting the bandwidth available 
among FPGAs. 


Figure 2 shows a data communications 
application requiring two FPGAs. For a 
typical implementation, half of the I/O 
bandwidth is allocated to external links and 
half to the inter-FPGA link. With 800 gen- 
eral-purpose I/O pins available, running at 
a typical speed of 250 MHz, the pair of 
FPGAs can support a pipeline throughput 
of 100 gigabits per second. 


If the 16 bidirectional MGT links are 
added, 16 links x 3.125 gigabits are avail- 
able in each direction, for another 100 


gigabits per second total. Figure 3 shows 
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Figure 2 - FPGA processing pipeline using 
general purpose I/O (GPIO) only 


100 Gbps more inter-FPGA bandwidth from Rocket I/O MGTs 
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Total of 200 Gbps between FPGAs 


Figure 3 - FPGA processing pipeline using 
Rocket I/O multi-gigabit transceivers and 
general purpose I/O (GPIO) 
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Up to 20 inches of 50 or 75 ohm 
PC board transmission line 


Figure 4 - Rocket I/O MGTs configured 
to pre-emphasize data transmission over PCB 
traces up to 20 inches long 


the system diagram for two Virtex II Pro 
devices including the Rocket I/O MGTs 
between the devices. With Rocket I/O, the 
total bandwidth available between the two 
FPGAs increases to 200 gigabits per sec- 
ond. This extra bandwidth is extremely 
useful in switching applications that may 
require up to 100% internal overhead to 
handle control protocols over and above 
This 2X 


increase is an example of the dramatic I/O 


the datapath requirements. 


performance increase available in Virtex-II 


Pro Platform FPGAs. 
Aurora Boosts FPGA-to-FPGA Links 


Xilinx has developed a software macro 
called the Aurora core that provides easy 16- 
bit and 32-bit interfaces from FPGA-to- 
FPGA using one or more Rocket I/O 
MGTs. The Aurora core handles the fram- 
ing, synchronization, and channel bonding 
tasks, allowing you to focus more on their 
application. A single MGT can provide a 
16-bit or 32-bit FPGA-to-FPGA interface. 
The software Aurora core coupled with 
hard core (implemented in silicon) MGTs 
can create powerful serial-to-parallel and 


parallel-to-serial tranceivers. 
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Alternatively, multiple MGTs can be bond- 
ed together to form a higher-bandwidth 
interface. The Aurora macro ensures that 
channel-bonded data will appear on the 
same clock cycle at the other end of the 
communications link. The 8B/10B encod- 
ing method is used for the Aurora soft 
macro, giving an effective bandwidth of 10 


gigabits per second for a set of four channel- 


bonded MGTs between two FPGAs. 
Pre-Emphasis Goes the Distance 


Another important feature of the Rocket 
I/O solution is pre-emphasis, which com- 
pensates for the filtering effects of FR-4 PC 
board material at gigabit speeds. Pre- 
emphasis boosts the output levels to com- 
pensate for the filtering effects of extended 
PCB traces. With pre-emphasis, PCB runs 
of 20 inches or longer can be supported relli- 


ably at speeds of 3.125 gigabits per second. 


Figure 4 shows how two MGTs on separate 
FPGAs can be easily linked up to 20 inches 


No Pre-Emphasis 








Figure 5 -Received signals after 50 inches FR-4 PC 


board material with no pre-emphasis and with 
30% pre-emphasis in the Rocket I/O MGT. 


apart using standard PC board transmis- 
sion-line traces. Pre-emphasis improves the 
noise and jitter performance at these high 
speeds for longer PC traces where disper- 


sion affects timing and voltage margins. 


You can clearly see in Figure 5, which shows 
oscilloscope traces at the receiver without 
pre-emphasis and with 30% pre-emphasis. 
The pre-emphasis enables reliable signal 
transmission at 3.125 Gigabits per second 
over long PC board traces in standard FR-4 
PC board material. Transmitter and receiv- 
er termination is provided internally to each 
MGT, eliminating the need for external ter- 
mination of these transmission lines. The 
termination impedance can be set to 50 


ohms or to 75 ohms. 
Conclusion 


The Rocket I/O MGTs enable high-speed 
interfaces for Virtex-II Pro Platform FPGAs. 
Standards such as InfiniBand Architecture, 
Fibre Channel, XAUI, and Gigabit Ethernet 
are directly supported. Inter-FPGA commu- 
nications are greatly enhanced by Rocket 
I/O modules and the Aurora soft macro, 
which enable simple FPGA links with chan- 
nel-bonded performance of over 10 gigabits 
per second. Using pre-emphasis, PCB trace 
runs of 20 inches or greater are possible at 
speeds of up to 3.125 gigabits per second per 
link. In the larger Virtex-II Pro devices, this 
represents a factor of two increase in total 
I/O performance available per FPGA. 
Clearly, Virtex-II Pro Platform FPGAs with 
Rocket I/O MGITs give you a competitive 
advantage, both in terms of performance 


and time to market. 


The Xilinx XPERTS Program 
The Xilinx XPERTS program identifies 


engineering firms around the world 

who have demonstrated significant 
expertise in developing Xilinx FPGA- 

based electronic products and solutions. 
For more information on how Rapid 
Prototypes Inc. can meet your high- 
performance FPGA application 
requirements, please visit 
www.EPGA.com. 
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Insight Electronics recently introduced a 


complete solution for developing designs 
and applications based on the new high- 
performance, ultra-low power Xilinx 
CoolRunner™-IIT CPLD family. The 
Insight CoolRunner-II Development Kit 
enables experimentation with CoolRunner- 
II design con- 
cepts, including 
the investigation 
of multiple I/O 


standards, clock 


UNBEATABLE COMBINATION OF Hlth PERFORMANCE AND 
ULTRA LOW POWER THAT COOLRUNNER-II DEVICES OFFER. 


— STEVE SHARP, SENIOR MANAGER OF SILICON SOLUTIONS MARKETING. 


management, 
security, and 
CPLD power 


consumption. 


“Releasing our 

CoolRunner-II Development Kit in tan- 
dem with the Xilinx CoolRunner-II fam- 
ily introduction continues Insight’s suc- 
cessful development kit introduction 
strategy, said Jim Beneke, director of 
Technical Marketing at Insight. “This 
approach gives designers immediate 
access to the devices’ full range of fea- 
tures, allowing them to verify the ultra- 
low power and high performance of the 
Xilinx second generation Fast Zero 
Power™ (FZP) technology as used in the 
CoolRunner-II family.” 


The Insight kit includes the 64 macrocell, 
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“MILINX 1S PLEASED TO HAVE INSIGHT PROVIDING TIMELY SUPPORT 
FOR OUR BREAKTHROUGH COOLRUNNER:|I REALDIGITAL CPLDS, 
THROUGH THIS DEVELOPMENT KIT, USERS CAN EVALUATE THE 





Se New Coolkunnerl CPLD 
A Zt Development Kit 


a 


1.8V CoolRunner-II device; multiple 
options for power supplies, I/O voltages, 
and clock sources; a JTAG port; 45 user 
I/Os; a prototyping area; a two-digit LCD 
display; and two push-button switches. 
The Xilinx ISE WebPACK™ software 
contains all the development software you 
need, and can be 
downloaded at no 
cost from the Xilinx 
website, or is includ- 
ed in the WebPACK 


version of the kit. 


“Xilinx is pleased to 
have Insight provid- 
ing timely support 
for our breakthrough 
CoolRunner-II RealDigital CPLDs,” stat- 
ed Steve Sharp, senior manager of Silicon 
“Through this 


development kit, users can evaluate the 


Solutions Marketing. 


unbeatable combination of high perform- 
ance and ultra low power that 


CoolRunner-II devices offer.” 
Price and Availability 


Available for order now, Insight’s 
CoolRunner-I Development Kit is priced 
at $95. For more information on this and 
other Xilinx design kits available from 
Insight, call 888-488-4133, or go to 


www. insight-electronics.com/coolrunner2. 


An ideal kit for applying CPLD technology 
fo high-performance portable 
and battery-powered systems. 


About Insight Electronics 


Insight Electronics is a member 
of The Memec Group, the world’s 
leading distributor of proprietary 

advanced-technology semiconductors. 

With 52 divisions in the U.S., 

Mexico, Canada, and South America, 
Insight is the largest specialty semi- 
conductor distributor in North and 

South America. Insight focuses its 
line card on leading semiconductor 
companies, enabling their sales team 
and field application engineers to 
devote their time and resources 

to mastery of supplier products, 
processes, and technologies. The 

company’s value-added services, 

design expertise, inventory manage- 
ment, logistics, and e-business tools 
ensure that customers receive the 
products they need, when and 
where they need them. For more 
information, visit the Insight Website 


at Www. sight com. 


Insight 
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CoolRunner-i The RealDigital CPLD is the breakthrough in 


high-performance and low-power you’ve been looking for. Shipping right 
now, the new CoolRunner”-I| family of devices are the only CPLDs offering 
a 100% digital core, eliminating power-hungry analog sense amps. 


A real-world feature set for real-world challenges 
With complete !/O support including HSTL and SSTL, you've got the real-world 
interfacing you need today. Our unique 
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current. It’s fast. It’s cool. It’s available now! 


FREE downloadable software 

Thenew CoolRunne-!| RealDigital CPLDs are fully supported by the easy-to-use 

ISE WebPACK”, downloadable FREE via the Internet. Or choose our ISE 4.1i 
software — the fastest, most advanced 


| tates Sie ee — . development system in the industry, and part 


clock off-chip. And the Clock Doubler = ["DaviceFamily  ecnintnnrra of the total solution offered only by Xilinx. 


gives you speeds up to 400 MHz. The new <100 pA 
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Summary of Virtex-ll Pro Features 

¢ High-performance Platform FPGA solution including : 
- Up to sixteen Rocket I/O™ embedded multi-gigabit 

transceiver blocks (based on Mindspeed's ‘ 

SkyRail™ technology) ° 

- Up to four IBM® PowerPC™ RISC processor Z 

blocks : 

¢ Based on Virtex™-I| Platform FPGA technology . 

- Flexible logic resources . 

-  SRAM-based in-system configuration 3 


- Active Interconnect™ technology 

- SelectRAM™ memory hierarchy 

- Dedicated 18-bit x 18-bit multiplier blocks 

- High-performance clock management circuitry 
-  Selectl/O™-Ultra technology 

- Digitally Controlled Impedance (DCI) I/O : 


The members and resources of the Virtex-Il Pro family are - 
shown in Table 1. 


Rocket I/O™ Features 


¢ Full-duplex serial transceiver (SERDES) capable of baud 
rates from 622 Mb/s to 3.125 Gb/s 

¢ 80 Gb/s duplex data rate (16 channels) 

¢ Monolithic clock synthesis and clock recovery (CDR) if 

¢ Fibre Channel, Gigabit Ethernet, 10-Gbit Attachment Unit ° 
Interface (XAUI), and Infiniband-compliant transceivers z 


Virtex-Il Pro Platform FPGAs: 
Introduction and Overview 


Advance Product Specification 


8-, 16-, or 32-bit selectable internal FPGA interface 
8B/10B encoder and decoder 

50Q / 75Q on-chip selectable transmit and receive ter- 
minations 

Programmable comma detection 

Channel bonding support (two to sixteen channels) 
Rate matching via insertion/deletion characters 
Four levels of selectable pre-emphasis 

Five levels of output differential voltage 
Per-channel internal loopback modes 

2.5V transceiver supply voltage 


PowerPC RISC Core Features 


Embedded 300+ MHz Harvard architecture core 

Low power consumption: 0.9 mW/MHz 

Five-stage data path pipeline 

Hardware multiply/divide unit 

Thirty-two 32-bit general purpose registers 

16 KB two-way set-associative instruction cache 

16 KB two-way set-associative data cache 

Memory Management Unit (MMU) 

-  64-entry unified Translation Look-aside Buffers 
(TLB) 

- Variable page sizes (1 KB to 16 MB) 

Dedicated on-chip memory (OCM) interface 

Supports IBM CoreConnect™ bus architecture 

Debug and trace support 

Timer facilities 


Table 1: Virtex-Il Pro Field-Programmable Gate Array Family Members 


























(1 CLB =4 iest Max 128 bits) Block SelectRAM 
Rocket I/O | PowerPC Maximum | 18 X 18 Bit Max 
Transceiver | Processor Array Distributed | Multiplier | 18 Kbit | Block RAM Max 
Device Blocks Blocks | RowxCol/| Slices | RAM Kbits Blocks Blocks Kbits DCMs | I/O Pads 

XC2VP2 4 0 16x22 | 1,408 4d. 2 12 216 4 204 
XC2VP4 4 1 40x22 | 3,008 94 28 28 504 4 348 
KeE2Vey 8 1 40x34 | 4,928 154 44 44 792 4 396 
XC2VP20 8 2 96x46 | 9,280 290 88 88 1,584 8 564 
XC2VP50 16 4. 68% (OMi22,092 706 216 216 3,888 8 852 
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General Description 


Virtex-Il Pro Platform FPGA Technology 


SelectRAM memory hierarchy 
- Up to 4 Mbit of True Dual-Port RAM in 18-Kb block 
SelectRAM resources 
- Up to 706 Kb of distributed SelectRAM resources 
- High-performance interfaces to external memory 
Arithmetic functions 
- Dedicated 18-bit x 18-bit multiplier blocks 
- Fast look-ahead carry logic chains 
Flexible logic resources 
- Up to 45,184 internal registers / latches with Clock 
Enable 
- Up to 45,184 Look-Up Tables (LUTs) or cascadable 
variable (1 to 16 bits) shift registers 
- Wide multiplexers and wide-input function support 
- Horizontal cascade chain and Sum-of-Products 
support 
- Internal 3-state busing 
High-performance clock management circuitry 
- Up to eight Digital Clock Manager (DCM) modules 
Precise clock de-skew 
Flexible frequency synthesis 
High-resolution phase shifting 
- 16 global clock multiplexer buffers in all parts 
Active Interconnect™ technology 
- Fourth-generation segmented routing structure 
- Fast, predictable routing delay, independent of 
fanout 
- Deep sub-micron noise immunity benefits 
Selectl/O-Ultra™ technology 
- Up to 852 user I/Os 
- Twenty two single-ended standards and 
five differential standards 
- Programmable LVTTL and LVCMOS sink/source 
current (2 mA to 24 mA) per I/O 
- Digitally Controlled Impedance (DCI) I/O: on-chip 
termination resistors for single-ended I/O standards 
- PCI support (designated banks only) 
- Differential signaling 
- 840 Mb/s Low-Voltage Differential Signaling I/O 
(LVDS) with current mode drivers 
Bus LVDS |/O 
HyperTransport (LDT) I/O with current driver 
buffers 
* Built-in DDR input and output registers 
- Proprietary high-performance SelectLink 
technology for communications between Xilinx 
devices 
High-bandwidth data path 
Double Data Rate (DDR) link 
Web-based HDL generation methodology 
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¢ SRAM-based in-system configuration 
- Fast SelectMAP™ configuration 
- Triple Data Encryption Standard (DES) secu- 

rity option (bitstream encryption) 

-  |EEE1532 support 
- Partial reconfiguration 
- Unlimited reprogrammability 
- | Readback capability 

¢ Supported by Xilinx Foundation™ and Alliance™ 
series development systems 
- Integrated VHDL and Verilog design flows 
- ChipScope™ Integrated Logic Analyzer 

¢ 0.13-um, nine-layer copper process with 90 nm 
high-speed transistors 

¢ 1.5V (VCCINT) core power supply, dedicated 
2.5V VCCAUX auxiliary and Veco I/O power 
supplies 

¢ IEEE 1149.1 compatible boundary-scan logic 
support 

¢  Flip-Chip and Wire-Bond Ball Grid Array (BGA) 
packages In standard 1.00 mm pitch 

¢ Each device 100% factory tested 


General Description 


The Virtex-Il Pro family is a platform FPGA for designs that 
are based on IP cores and customized modules. The family 
incorporates multi-gigabit transceivers and PowerPC CPU 
cores in Virtex-Il Pro series FPGA architecture. It empowers 
complete solutions for telecommunication, wireless, net- 
working, video, and DSP applications. 


The leading-edge 0.13um CMOS nine-layer copper 
process and the Virtex-Il Pro architecture are optimized 
for high performance designs in a wide range of densities. 
Combining a wide variety of flexible features and IP cores, 
the Virtex-Il Pro family enhances programmable logic 
design capabilities and is a powerful alternative to mask- 
programmed gates arrays. 
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Architecture 


Virtex-ll Pro Array Overview 


Virtex-Il Pro devices are user-programmable gate arrays 

with various configurable elements and embedded cores 

optimized for high-density and high-performance system 

designs. Virtex-Il Pro implements the following functionality: 

¢ Embedded high-speed serial transceivers enable data 
bit rate up to 3.125 Gb/s per channel. 

¢ Embedded IBM PowerPC 405 RISC CPU cores provide 
performance of 300+ MHz. 

¢ Selectl/O-Ultra blocks provide the interface between 
package pins and the internal configurable logic. Most 
popular and leading-edge I/O standards are supported 
by the programmable IOBs. 

¢ Configurable Logic Blocks (CLBs) provide functional ele- 
ments for combinatorial and synchronous logic, includ- 
ing basic storage elements. BUFTs (3-state buffers) 
associated with each CLB element drive dedicated seg- 
mentable horizontal routing resources. 

¢ Block SelectRAM memory modules provide large 18-Kb 
storage elements of True Dual-Port RAM. 

¢ Embedded multiplier blocks are 18-bit x 18-bit dedicated 
multipliers. 

¢ DCM (Digital Clock Manager) blocks provide self-cali- 
brating, fully digital solutions for clock distribution delay 
compensation, clock multiplication and division, and 
coarse- and fine-grained clock phase shifting. 


A new generation of programmable routing resources called 
Active Interconnect Technology interconnects all of these ele- 
ments. The general routing matrix (GRM) is an array of rout- 
ing switches. Each programmable element is tied to a switch 
matrix, allowing multiple connections to the general routing 
matrix. The overall programmable interconnection is hierar- 
chical and designed to support high-speed designs. 


All programmable elements, including the routing resources, 
are controlled by values stored in static memory cells. These 
values are loaded in the memory cells during configuration 
and can be reloaded to change the functions of the program- 
mable elements. 


Virtex-ll Pro Features 
This section briefly describes Virtex-II Pro features. 


Rocket I/O Multi-Gigabit Transceiver Cores 


The Rocket I/O Multi-Gigabit Transceiver core, based on 
Mindspeed’s SkyRail technology, is a flexible parallel-to- 
serial and serial-to-parallel transceiver embedded core used 
for high-bandwidth interconnection between buses, back- 
planes, or other subsystems. 


Multiple user instantiations in an FPGA are possible, provid- 
ing up to 80 Gb/s of full duplex raw data transfer. Each chan- 
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Virtex-Il Pro FPGAs: Introducton and Overview 


nel can be operated at a maximum data transfer rate of 
er || 20) (E/9) cy 


Each Rocket I/O core implements the following function- 
ality: 


Serializer and deserializer (SERDES) 

Monolithic clock synthesis and clock recovery (CDR) 

Fibre Channel, Gigabit Ethernet, XAUI, and 

Infiniband compliant transceivers 

8-, 16-, or 32-bit selectable FPGA interface 

8B/10B encoder and decoder with bypassing option 

on each channel 

Channel bonding support (two to sixteen channels) 

- Elastic buffers for inter-chip deskewing and 
channel-to-channel alignment 

Receiver clock recovery tolerance of up to 75 non- 

transitioning bits 

50Q / 75Q on-chip selectable TX and RX terminations 

Programmable comma detection 

Rate matching via insertion/deletion characters 

Automatic lock-to-reference function 

Optional TX and RX data inversion 

Four levels of pre-emphasis support 

Per-channel serial and parallel transmitter-to-receiver 

internal loopback modes 

Cyclic Redundancy Check (CRC) support 


PowerPC 405 


The PPC405 RISC CPU can execute instructions at a 
sustained rate of one instruction per cycle. On-chip 
instruction and data cache reduce design complexity and 
improve system throughput. 


PPC405 features include: 


PowerPC RISC CPU 

- Implements the PowerPC User Instruction Set 
Architecture (UISA) and extensions for embedded 
applications 

- Thirty-two 32-bit general purpose registers (GPRs) 

- Static branch prediction 

- Five-stage pipeline with single-cycle execution of 
most instructions, including loads/stores 

- Unaligned and aligned load/store support to 
cache, main memory, and on-chip memory 

- Hardware multiply/divide for faster integer arith- 
metic (4-cycle multiply, 35-cycle divide) 

- Enhanced string and multiple-word handling 

-  Big/little endian operation support 

Storage Control 

- Separate instruction and data cache units, both 
two-way set-associative and non-blocking 

- Eight words (32 bytes) per cache line 

- 16 KB array Instruction Cache Unit (ICU), 16 KB 
array Data Cache Unit (DCU) 
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- QOperand forwarding during instruction cache line fill 
- Copy-back or write-through DCU strategy 
- Doubleword instruction fetch from cache improves 
branch latency 
¢ Virtual mode memory management unit (MMU) 
- Translation of the 4 GB logical address space into 
physical addresses 
- Software control of page replacement strategy 
- Supports multiple simultaneous page sizes ranging 
from 1 KB to 16 MB 
¢ OCM controllers provide dedicated interfaces between 
Block SelectRAM memory and processor core instruc- 
tion and data paths for high-speed access 
¢ PowerPC timer facilities 
- 64-bit time base 
- Programmable interval timer (PIT) 
- Fixed interval timer (FIT) 
- Watchdog timer (WDT) 
¢ Debug Support 
- Internal debug mode 
- External debug mode 
- Debug Wait mode 
- Real Time Trace debug mode 
- Enhanced debug support with logical operators 
- Instruction trace and trace-back support 
- Forward or backward trace 
¢ Two hardware interrupt levels support 
e Advanced power management support 


Input/Output Blocks (IOBs) 
lIOBs are programmable and can be categorized as follows: 


¢ Input block with an optional single data rate (SDR) or 
double data rate (DDR) register 

¢ Output block with an optional SDR or DDR register and 
an optional 3-state buffer to be driven directly or through 
SDR or DDR register 

¢ Bi-directional block (any combination of input and output 
configurations) 


These registers are either edge-triggered D-type flip-flops or 
level-sensitive latches. 


lIOBs support the following single-ended I/O standards: 


orem le\TmIile 

¢ LVCMOS (3.3V, 2.5V, 1.8V, and 1.5V) 

¢ PCI (83 and 66 MHz) 

= 7 Gilileand (Giles 

¢ HSTL 1.5V and 1.8V (Class |, Il, III, and IV) 
¢ SSTL3.3V and 2.5V, (Class | and Il) 


The digitally controlled impedance (DCI) I/O feature auto- 
matically provides on-chip termination for each single-ended 
I/O standard. 
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The IOB elements also support the following differential 
signaling I/O standards: 


- LVDS and Extended LVDS (2.5V only) 
-  BLVDS (Bus LVDS) 

- ULVDS 

- LDT 


Two adjacent pads are used for each differential pair. Two 
or four IOB blocks connect to one switch matrix to access 
the routing resources. 


Configurable Logic Blocks (CLBs) 


CLB resources include four slices and two 3-state buffers. 
Each slice is equivalent and contains: 


¢ Two function generators (F & G) 

¢ Two storage elements 

¢ Arithmetic logic gates 

¢ Large multiplexers 

¢ Wide function capability 

e Fast carry look-ahead chain 

¢ Horizontal cascade chain (OR gate) 


The function generators F & G are configurable as 4-input 
look-up tables (LUTsS), as 16-bit shift registers, or as 16-bit 
distributed SelectRAM memory. 


In addition, the two storage elements are either edge-trig- 
gered D-type flip-flops or level-sensitive latches. 


Each CLB has internal fast interconnect and connects to 
a switch matrix to access general routing resources. 


Block SelectRAM Memory 


The block SelectRAM memory resources are 18 Kb of 
True Dual-Port RAM, programmable from 16K x 1 bit to 
512 x 36 bit, in various depth and width configurations. 
Each port is totally synchronous and independent, offer- 
ing three “read-during-write” modes. Block SelectRAM 
memory is cascadable to implement large embedded 
Storage blocks. Supported memory configurations for 
dual-port and single-port modes are shown in Table 2. 


Table 2: Dual-Port and Single-Port Configurations 




















16K x 1 bit 4K x 4 bits 1K x 18 bits 
8K x 2 bits 2K x 9 bits 512 x 36 bits 
18 X 18-Bit Multipliers 


A multiplier block is associated with each SelectRAM mem- 
ory block. The multiplier block is a dedicated 18 x 18-bit 2s 
complement signed multiplier, and is optimized for opera- 
tions based on the block SelectRAM content on one port. 
The 18 x 18 multiplier can be used independently of the 
block SelectRAM resource. Read/multiply/accumulate 
operations and DSP filter structures are extremely efficient. 
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Both the SelectRAM memory and the multiplier resource are 
connected to four switch matrices to access the general 
routing resources. 


Global Clocking 


The DCM and global clock multiplexer buffers provide a 
complete solution for designing high-speed clock schemes. 


Up to eight DCM blocks are available. To generate deskewed 
internal or external clocks, each DCM can be used to elimi- 
nate clock distribution delay. The DCM also provides 90, 180, 
and 270-degree phase-shifted versions of its output clocks. 
Fine-grained phase shifting offers high-resolution phase 
adjustments in increments of 1/256 of the clock period. Very 
flexible frequency synthesis provides a clock output frequen- 
cy equal to a fractional or integer multiple of the input clock 
frequency. For exact timing parameters, see Virtex-/l Pro 
Platform FPGAs: DC and Switching Characteristics. 


Virtex-Il Pro devices have 16 global clock MUX buffers, with 
up to eight clock nets per quadrant. Each clock MUX buffer 
can select one of the two clock inputs and switch glitch-free 
from one clock to the other. Each DCM can send up to four 
of its clock outputs to global clock buffers on the same edge. 
Any global clock pin can drive any DCM on the same edge. 


Routing Resources 


The IOB, CLB, block SelectRAM, multiplier, and DCM ele- 
ments all use the same interconnect scheme and the same 
access to the global routing matrix. Timing models are 
shared, greatly improving the predictability of the perform- 
ance of high-speed designs. 


There are a total of 16 global clock lines, with eight avail- 
able per quadrant. In addition, 24 vertical and horizontal 
long lines per row or column, as well as massive secondary 
and local routing resources, provide fast interconnect. 
Virtex-Il Pro buffered interconnects are relatively unaffected 
by net fanout, and the interconnect layout is designed to 
minimize crosstalk. 


Horizontal and vertical routing resources for each row or 
column include: 


¢ 24 long lines 

¢ 120 hex lines 

¢ 40 double lines 

¢ 16 direct connect lines (total in all four directions) 


Boundary Scan 


Boundary-scan instructions and associated data registers 
support a standard methodology for accessing and config- 
uring Virtex-Il Pro devices, complying with IEEE standards 
1149.1 and 1532. A system mode and a test mode are 
implemented. In system mode, a Virtex-Il Pro device will 
continue to function while executing non-test boundary-scan 
instructions. In test mode, boundary-scan test instructions 
control the I/O pins for testing purposes. The Virtex-Il Pro 
Test Access Port (TAP) supports BYPASS, PRELOAD, 
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SAMPLE, IDCODE, and USERCODE non-test instruc- 
tions. The EXTEST, INTEST, and HIGHZ test instructions 
are also supported. 


Configuration 


Virtex-Il Pro devices are configured by loading the bit- 
stream into internal configuration memory using one of 
the following modes: 


¢« Slave-serial mode 

e Master-serial mode 

¢ Slave SelectMAP mode 

¢ Master SelectMAP mode 

¢ Boundary-Scan mode (IEEE 1532) 


A Data Encryption Standard (DES) decryptor is avail- 
able on-chip to secure the bitstreams. One or two triple- 
DES key sets can be used to optionally encrypt the con- 
figuration data. 


The Xilinx System Advanced Configuration Enviornment 
(System ACE) family offers high-capacity and flexible 
solution for FPGA configuration as well as program/data 
storage for the processor. See DS080, System ACE 
CompactFlash Solution for more information. 


Readback and Integrated Logic Analyzer 


Configuration data stored in Virtex-ll Pro configuration 
memory can be read back for verification. Along with the 
configuration data, the contents of all flip-flops/latches, 
distributed SelectRAM, and block SelectRAM memory 
resources can be read back. This capability is useful for 
real-time debugging. 


The Xilinx ChipScope Pro Integrated Logic Analyzer (ILA) 
cores and Integrated Bus Analyzer (IBA) cores, along with 
the ChipScope Pro Analyzer software, provide a complete 
solution for accessing and verifying user designs within 
Virtex-Il Pro devices. 
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IP Core and Reference Support 


Intellectual Property is part of the Platform FPGA solution. 
In addition to the existing FPGA fabric cores, the list below 
shows some of the currently available hardware and soft- 
ware intellectual properties specially developed for Virtex- 
Il Pro by Xilinx. Each IP core is modular, portable, Real- 
Time Operating System (RTOS) independent, and 
CoreConnect compatible for ease of design migration. 
Refer to www.xilinx.com for the latest and most complete 
list of cores. 


Hardware Cores 


¢ Bus Infastructure cores (arbiters, bridges, and more) 
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¢ Memory cores (Flash, SRAM, and more) 
Peripheral cores (UART, IIC, and more) 


¢ Networking cores (ATM, Ethernet, and more) 


Software Cores 

° Boot code 

° Test code 

° Device drivers 

° Protocol stacks 

. RTOS integration 


° Customized board support package 


Virtex-Il Pro Device/Package Combinations and Maximum I/Os 


Offerings include ball grid array (BGA) packages with 1.0 
mm pitch. In addition to traditional wire-bond interconnects, 
flip-chip interconnect is used in some of the BGA offerings. 
The use of flip-chip interconnect offers more I/Os than is 
possible in wire-bond versions of the similar packages. Flip- 
chip construction offers the combination of high pin count 
and excellent power dissipation. 


The Virtex-ll Pro device/package combination table (Table 
3) details the maximum number of I/Os for each device 
and package using wire-bond or flip-chip technology. 


¢ FG denotes wire-bond fine-pitch BGA (1.00 mm pitch). 
¢ FF denotes flip-chip fine-pitch BGA (1.00 mm pitch). 
¢ BF denotes flip-chip fine-ptich BGA (1.27 mm pitch). 


Table 3: Virtex-ll Pro Device/Package Combinations and Maximum Number of 


Available I/Os (Advance Information) 


















































User Available I/Os 
Package Pitch (mm) | Size (mm) XC2VP2 XC2VP4 XC2VP7 XC2VP20 XC2VP50 
FG256 1.00 eax 140 140 
FG456 1.00 PEG) YA fee! ioe 248 248 
Eiore 1.00 21 X27 204 348 396 
FF896 1.00 S10 SL 396 556 
Riri 152 1.00 30 X 30 564 692 
euler 1.00 40 x 40 Soe 
BF957 Weer 40 x 40 564 584 
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The new Spartan-I1E FPGAs deliver over 20 billion The GSP Web Portal ... a Unique Designer’s Resource 
M AC¢s/s, giving you the DSP performance required The Xilinx GSP web portal gives you all the support you could wish for 
for high-resolution digital images. Together with as standards continue to rapidly change. 7 
advanced signaling standards including LVDS, Simply log on to xilinx.com/esp and ci Sesiaiiiaiink prisideiahielieh 
, H STL and SSTL, you've got the speed you need you'll get immediate access to reference 
for digital video processing. boards, system block diagrams, a full soectrum of technology & stan- 
From Pictureto Pixel, the Complete Solution dards tutorials, and much more. You'll soon see why ESP is one of the 
Through capture, process and display, Spartan-IIE FPGAs bring the total most successtul websites of its kind. 


solution into focus. Encryption IP, operating at high speed 
video rates, offers a new level of security protecting every pixel. 
And the desktop programmability of Spartan-IIE FPGAs 
solves your time-to-market pressures as design cycles get shorter. 


iy =fE Visit www.xilinx.com/spartan2e/dvt today and find out 


—___ why Spartan-I1E FPGAs are the clear choice for digital 
; video design. 
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CLB (1 CLB = 4 slices = Max 128 bits) Block SelectRAM 
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CLB Resources BLK RAM CLK Resources I/O Features 


Virtex-II Family — 1.5 Volt 






































































FPGAs 





LDT-25, LVPECL-33, cee 0.4 
80K | 168 | 512 | 1,152_| 1,024 | 16k | 8 | 144k | 8 | 24/420 | 4 | DCM| DCM| YES | 60 | 120 |  LVDS-33, LVDS-25, 0.6M 
250K | 24x16 | 1,536 | 3,456 | 3,072 | 48k | 24 | 432K | 24 | 24/420 | 8 | DCM| DCM| YES | 100 | 200 | LVDSEXT-33, LVDSEXT-25, 1.7M 
500K | 32x24 | 3,072 | 6,912 | 6144 | 96K | 32 | 576K | 32 | 24/420 | 8 | DCM|DCM| YES | 132 | 264 | BLVDS-25, ULVDS-25, 2.8M 
iM | 40x32 | 5,120 | 11,520 | 10,240 | 160K | 40 | 720K | 40 | 24/420 | 8 | DCM|DCM| Yes | 216 | 432 | — LVTTL, LVCMOs33, 4.1M 
15M | 48x40 | 7,680 | 17,280 | 15,360 | 240K | 48 | 864K | 48 | 24/420 | 8 | DCM| DCM| YES | 264 | 528 | LVCMOS25, LVCMOS18, B|5 |_5.7M 
2M | 56x48 | 10,752_| 24,192 | 21,504 | 336K _| 56 | 1008K | 56 | 24/420 | 8 | DCM| DCM | YES | 312 | 624 | LVCMOS15, PCI33, PCI66, 7.5M 
3M | 64x56 | 14,336 | 32,256 | 28,672 | 448K_| 96 | 1728K | 96 | 24/420 | 12 | DCM| DCM | YES | 360 | 720 | PCI-X, GTL, GTL+, HSTL, 10.5M 
912 | HSTL I, HSTL Ill, HSTL WV, 15.7M 
SSTL21, SSTL211, 21.9M 
am | 112x104 | 46,592 | 104,832 | 93,184 | 1456K | 168 | 3024K | 168 | 20/420 | 12 | DCM | DCM | YES | 554 | 1108 | SSTL3|,SSTL3 I 29.1M 
Virtex-E Family — 1.8 Volt .18um Six Layer Metal Process 
| 72k | 16x24 | 768 | 1,728 | 1,536 | 24K | 16 | 64k | NA | 25/350 | 8 | YES | YES | NA | 88 | 176_| 0.6M 
| 128K | 20x30 | 1,200 | 2,700 | 2,400 | 37.5k | 20 | 80K | NA | 25/350 | 8 | ves | YES | NA | 98 | 196 | 0.9M 
306K | 28x42 | 2,352_| 5,292 | 4,704 | 73.5K | 28 | 112K | NA | 25/350 | 8 | YES | YES | NA | 142 | 284 | — LVTTL, LVCMOS2, 145M 
A12K | 32x48 | 3,072 | 6,912 | 6144 | 96K | 32 | 128K | NA | 25/350 | 8 | YES | YES | NA | 158 | 316 |  LVCMOS18, PCI33, 1.88M 
570K | 40x60 | 4,800 | 10,800 | 9,600 | 150K | 40 | 160K | NA | 25/350 | 8 | YES | YES | NA | 202 | 404 |  PCI66, GTL, GTL+, 2.7M 
986K | 48x72 | 6,912 | 15,552 | 13,824 | 216K | 72 | 288K | NA | 25/350 | 8 | YES | YES | NA | 256 | 512 | HSTLI, HSTL II, HSTL Wy, 5 |_3.97M 
1,569K | 64x96 | 12,288 | 27,648 | 24,576 | 384K | 96 | 384K | NA | 25/350 | 8 | YES | YES | NA | 330 | 660 |  SSTL31, SSTL3Il, 6.6M 
-2,188K_| 72x 180 | 25,920 | 34,992 | 51,840 | 486K | 144 | 576K | NA | 25/350 | 8 | YES | YES | NA | 362 | 724 | SSTL21, SSTL211, BLVDS, 8.4 
-2,542K | 80x120_ | 19,200 | 43,200 | 38,400 | 600K | 160 | 640K | NA | 25/350 | 8 | YES | YES | NA | 402 | 804 | LVDS, LVPECL 10.2M 
3,264K | 92x 138 | 25,392 | 57,132_| 50,784 | 793.5K | 184 | 736K _| NA | 25/350 | 8 | YES | YES | NA | 402 | 804 13M 
474K | 104x156 | 32,448 | 73,008 | 64896 | T014K | 208 | 832K | NA | 25/350. 8 | YES | YES | NA | 402 | 804 16.3 
Virtex-EM Family — 1.8 Volt -18um Six cen Te Process 
131M | 40x60 | 4800 | 10,800 | 9,600 | 150K | 140 | 560K | NA | 25/350 | 8 | YES | YES | NA | 202 | 404 Same As 67-8 5 3.43M 
Spartan-IIE Family — 1.8 Volt .18um Six <a 1s Br 
| 50K | 16x24 | 768 | 1,728 | 1,536 | 24k | 8 | 32k | NA | 25/320 | 4 | ves | YES | NA | 84 | 182 | LVITLWCMOs2,WOMOsIg | -6- 0.6M 
100K | 20x30 | 1,200 | 2700 | 2400 | 37.5K | 10 | 40K | NA | 25/320 | 4 | YES | YES | NA | 86 | 202 | PCIB3,PCI66,GTLGTL+, | -6- ran 0.9M 
HSTL | HSTL Il, HSTLIV SSTL3 | 11M 
SST13 I SSTL2|SSM12 I, AGP-2K 1.4M 
300k | 32x48 | 3,072 | 6912 | 6144 | 96K | 16 | aK | NA | 2520 | 4 | YES | YES | NA | 120 | 329 |CTTLVDS,BLVDS,WWPECL 19M 
Spartan-Il Family — 2.5 Volt .22/.18um pare ae Metal Process 
| 15K | 8xi2 [192 | 432 | 384 | eK | 4 | 16K _| NA | 25/200 | 4 | ves | ves | NA | NA | 86 | LvTTLivcMos2, |_-5-6 | 5 0.2M 
| 30K | 12x18 | 432, | 972 | 864 | 135K | 6 | 24K | NA | 25/200 | 4 | ves | YES | NA | NA | 132 | PCI33(33VA5V), |_-5-6 0.4 
| 50K | 16x24 | 768 | 1,728 | 1,536 | 24k | 8 | 32K | NA | 25/200 | 4 | Yes | YES | NA | NA | 176 | PCI66(3.3V), GTL,GTL+,|_-5 -6 a, |e |_0.6M 
100K | 20x30 | 1,200 | 2700 | 2400 | 37.5K | 10 | 40K | NA | 25/200 | 4 | YES | YES | NA | NA | 196 | HSTLI,HSTLII,HSTLIV, | -5-6 ~ | 2 |_0.8M 
260_| SSTL3I, SSTL3 Il, SSTL2 |, |_-5 -6 11M 
200K 28 x 42 2352 a2 o2 4,704 735K 14 56K NA | 25/200 | 4 | YES | YES | NA NA 284 SSTL2 Il, AGP-2X, CTT B-6 =) 1.4M 
Note: 1. System Gates include 20-30% of CLBs used as RAM Important: Verify all Data with Device Data Sheet 
2. A Logic Cell is defined as a 4 input LUT and a register 
DCM — Digital Clock Management 
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Xilinx FPGA Package Options and User |/0 


User Available I/Os 
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MISTER lime) ico sian 
S10 3535 | 
S100 fax | 





SPARTAN-II 


Yo EX-E SPARTAN-IIE 





Yr 














22 





PQFP Packages (PQ) 
146 | 146 | 146 | 146 | 146 132} 140 140 | 140 | 140 
158 | 158 | 158 | 158 | 158 | 158} 158 
HQFP Packages (HQ) 
| ea) a | | | | 
144 rrr Beef 6 | jor joo] | 
BGA Packages (BG) — wire-bond standard BGA (1.27 mm ball spacing) 


208 
| 240 rr 58} | | fase} Pt 
Chip Scale Packages — wire-bond chip-scale BGA (0.8 mm ball spacing) 
ae 

Eile SEaie ir 

SRE 

||| [328 |392}aos] || 

ze 





eA 88 |120 | 200 |264 | 432 |528 | 624 | 720 | 912 | 1104 | 1296 
240 
VQFP Packages (VQ) 
is 53 en aa oF | esel Can Md Eel cand dN cn) 2m ead (Ga al Oem Nine 20a fa 
TQFP Packages (TQ) 
| 144 SA SE a a a 
FGA Packages (FT) — wire-bond fine-pitch thin BGA (1.0 mm ball spacing) 
56 ooo 
FGA Packages (FG) — wire-bond fine-pitch BGA (1.0 mm ball spacing) 


176 |176) 176) 176 176) 176 | 176) 176 
P [200 calaza] [|| Pe eee 


| | | | 392 ]456|4ea] | a 
ff f5t2}512|st2j5i2) || 
| | | | [660/660 }660) | | 
| | | | |st2}660|700} | || [556 


660 | 724 | 804 | 804) 804 








FFA Packages (FF) — flip-chip fine-pitch BGA (1.0 mm ball spacing) 
896 432 528 624 Note: Virtex-Il packages FG456 and FG676 are footprint compatible. 


1152 oe 824 Virtex-Il packages FF896 and FF1152 are footprint compatible. 


1517 912 | 1104! 1108 Important: Verify all Data with Device Data Sheet 


BFA Packages (BF) — flip-chip fine-pitch BGA (1.27 mm ball spacing) 
P| || 684 68 ox 








Numbers indicated in the matrix are the maximum number of user 
I/O's for that package and device combination. 
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Xilinx CPLD Product Selection Matrix 


PRODUCT SELECTION MATRIX 







XC9500XV Family — 2.5 Volt 


"800 | 35/90) 182583 | 18253 |36/1/35/3-45-7| 7 3, 
‘iso0 | 72/90 12563 | 1ansaa|n|t|4|457| 7 [3[8 
3200 | 144/ 90 | 1.92.5/3.3 | 197253 |117[2/4| 45-7 | 7 [3/18 
‘s400 | 288) 90 1.2583 | 182533 [192|4] 5 [57-10] 10 1318 
XC9500XL Family — 3.3 Volt 

reco [36/90] 25835 | 2583 36) | 5 |-5-7-10|-7-10 (3) 18 
ricoo | 72 [90) 25835 | 2503 |72) | [57-10] 7-10 (318 
rs200 [44/90 258.35 | 2503 |u| [55-70 [-7-0 [3 [18 
's400 | 28/90] 25835 | 2583 [192) | 6|-6-7-10|7-0 [318 
CoolRunner XPLA3 — 3.3 Volt 

70 [32]a8) 335 | 33 (36) [5-57-10 | 7-10 [4] 16 
1500 | 64 [48] 33 | 33° [68 | | 6 | 6-7-10 | 7-10 |4| 16 
ra000 [128] 48) 335 | 33 (roe) | 6|-6-7-10| 7-10 [4] 16 
-s000 [256] 48/335 | a3 164] _|75)-7-10-12|-10-12 4 
9000 |3a/ 4a] 33 | 33 [220| |7.5|-7-10-12|-10-12 4 
2000 si2| 48) 33/5 | 33 (260) _|75|-7-10-12)-10-12|4 
CoolRunner-II Family — 1.8 Volt 

17 
T 
v7 
6000 | 256) 40 |1.5/1.8/2.5/3.3] 1.5/1.8/2.5/3.3/184/ 2 | 5 | 5-6-7 | 6-7 |3| 17 
|u| yananaaa|isnanaeaze| 6 [4-10 7-1 2 7 


12000 | 512] 40 |1.5/1.8/2.5/3.3) 1.5/1.8/2.5/3.3] 270] 4 
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PLCC Packages (PC) 
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PQFP Packages (PQ) 
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Chip Scale Packages (CP) — wire-bond chip-scale BGA (0.5 mm ball spacing) 


56 BIE EEREE BQ ORE 
Chip Scale Packages (CS) — wire-bond chip-scale BGA (0.8 mm ball spacing) 

48 sn et | | 
144 (a7 mi | ros) | Bees 
280 P| so} | to] tea] Parcel 


BGA Packages (BG) — wire-bond standard BGA (1.27 mm ball spacing) 


256 mei fanz fa 


FBGA Packages (FG) — wire-bond Fineline BGA (1.0 mm ball spacing) 
= A I 
220 260 240 | 270 
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Important: Verify all Data with Device Data Sheet and Product Availability with your local Xilinx Rep 
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Xilinx Software 


Feature ISE WebPACK ISE BaseX ISE Foundation ISE Alliance 


Pc PCIUNIX PCIUNIX 
Up to 300K (Virtex E and Virtex II) Up to 300K ALL ALL 


Spartan-Il Series ALL ALL ALL 


N AKE & Newer AKE & Newer AKE & Newer 


ALL ALL ALL ALL 
ALL ALL ALL ALL 


No Sold as an Option Sold as an Option Sold as an Option 


Yes Yes Yes Yes 




















Platform 
Virtex Series 
Spartan Series 
XC4000 Series 
CoolRunner Series 
XC9500 Series 
Modular Design 





| 





Educational Services 






Design Services ves Yes Yes Yes 
Support Services Yes Yes Yes 
Schematic Editor (Gate & Block Level HDL) Yes Yes No 
HDL Editor Yes Yes Yes Yes 
State Diagram Editor Yes Yes No 
CORE Generator No Yes Yes Yes 


N Sold as an Option Sold as an Option Sold as an Option 
Yes PC Only No 


Yes Yes Yes No 


No Yes PC Only PC Only 


Yes Yes Yes Yes 


CPLD CPLD CPLD (PC Only) No 


Xilinx System Generator for Simulink 
FPGA Express 
Xilinx Synthesis Technology (XST) 


= 
jo) Ss 













Synplify/Pro Integration 


Leonardo Integration 


iMPACT Yes Yes Yes Yes 
FloorPlanner Yes Yes Yes Yes 
Xilinx Constraints Editor Yes Yes Yes Yes 
Timing Driven Place & Route Yes Yes Yes Yes 










Timing Improvement Wizard No tes Yes Yes 


IBIS Models Yes Yes Yes Yes 
STAMP Models Yes Yes Yes Yes 
LMG SmartModels Yes Yes Yes Yes 
HDL Bencher Yes Yes No 
Model Sim Xilinx Edition (XE) ModelSim XE Starter Included ModelSim XE Starter Included ModelSim XE Starter Included ModelSim XE Starter Included 


(See www.xilinx.com for more information) (ModelSim XE Sold as an Option) (ModelSim XE Sold as an Option) (Model Sim XE Sold as an Option) (Model Sim XE Sold as an Option) 


Static Timing Analyzer Yes Yes Yes Yes 


Chipscope ILA No Sold as an Option Sold as an Option Sold as an Option 


FPGA Editor with Probe No Yes Yes Yes 
ChipViewer Yes Yes Yes Yes 


XPower (Power Analysis) Yes Yes Yes Yes 
3rd Party Simulator Support Yes Yes Yes Yes 


For more information on the complete list of Xilinx IP products, visit the Xilinx IP Center at http://www.xilinx.com/ipcenter 
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Configuration 





Xilinx Contiguration Storage Solutions 











Non-Volatile Media 


oD) 
(2) 
CG 
fom 
(o) 
d= 
NY 
(D) 
— 
= 
~~ 
Ha 
(o) 
Y 


= 
STAG Yes | Yes | Yes 


Min board space 


aD 
= 
e Compression 





Removable 





co) 
a) 
° 
= 
f=9) 
— 
= 
° 
U 
<x 
7) 
a. 
LL. 


System ACE CF up to 8 Gbit 


OFF Max Config. Speed 


H Memory Density 
aN Number of Components 





Us: 30 Mbit/sec CompactFlash 
System ACE MPM 16 Mbit | 1 | 12.25 cm’] Yes SelectIMAP (up to 4 FPGA) Up to 8 No | No | Yes | 152 Mbit/sec | AMD Flash Memory 
32 Mbit Slave-Serial (up to 8 FPGA chains) 
64 Mbit 
System ACE SC 16 Mbit | 3} Custom | Yes SelectMAP (up to 4 FPGA) Up to 8 No | No | Yes | 152 Mbit/sec | AMD Flash memory 
32 Mbit Slave-Serial (up to 8 FPGA chains) 
64 Mbit 


S OTP Configuration PROMs for Spartan-lI/IIE 
~3 excassoe [Yviy | || 33viy[y 
SIRSEE | xcassooe fy |y{y| | | |3av|y | 
measisoe fy iy) | | ¥|aav[y | y 
casooe fy fy) | |r savvy 
case) |) | | lr iaav|y | y 


| xcs [y{yiy{ | | |aavly|y 
vo | xcsso [ytyiy{ | | |saviy|y 
pa | Le fy faa] | Des c2s500 a Ea 
One-Time Programmable (OTP) Configuration PROMs PS esioon ey apa eelealecuna i 
smb | [y|y}y{ | [saviy [vy _ xcasiso fy {yty| | | [saviy |i 
ey . 


ee =< 
ee 
a 


ei Meleoe ov Weevonlny 


Seve tenes 





Xilinx Home Page Xilinx Education Center 

http://www.xilinx.com http://www.xilinx.com/support/education-home.htm 

Xilinx Online Support Xilinx Tutorial Center 
http://www.xilinx.com/support/support.htm http://www.xilinx.com/support/techsup/tutorials/index.htm 
Xilinx IP Center Xilinx WebPACK 

http://www.xilinx.com/ipcenter/index.htm http://www.xilinx.com/sxpresso/webpack.htm 
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Xilinx Global Services 


Xilinx Global Services 


Xilinx Design Services Part Number Product Description Duration | Availability 





Educational Services 


e Provide extensive FPGA hardware and 


embedded software design experience FPGA13000-4-ILT Fundamentals of FPGA Design (v4.x) 8 Now 
backed by industry recognized experts and FPGA16000-4-ILT ISE Design Entry (v4.x) — Instructor-Led course hee Now 
resources to solve even the most complex 
design challenge. FPGA23000-4-ILT Designing for Performance (v4.x) Now 
‘ System Architecture Consulting Be Pravide LANG1100-3-ILT VHDL — Introduction to VHDL (v3.x) — Instructor-Led Course Now 
engineering services to define system architecture LANG11000-4-ILT VHDL — Introduction to VHDL (v4.x) — Instructor-Led Course November 
and partitioning for design specification. 
LANG12000-4-ILT VERILOG — Introduction to Verilog (v4.x) — Instructor-Led Course Now 
: a= : : 
Custom Design Solutions — Project designed, PCI1800-3-ILT PCI CORE Basics (v3.x) — Instructor-Led Course 8 | Now 
verified, and delivered to mutually agreed 
upon design specifications. PCI18000-4-ILT PCI CORE Basics (v4.x) — Instructor-Led Course ea December 
e IP Core Development, Optimization, Integration, PCI2800-3-ILT PCI — Designing a PCI System (v3.x) — Instructor-Led Course Now 
Modification, and Verification —3 Modify, PC128000-4-ILT PCI — Designing a PCI System (v4.x) — Instructor-Led Course December 
integrate, and optimize customer intellectual — 
property or chiral party cores to wotlk nek PC|2900-3-ILT PCI-X — Designing for PCI-X (v3.x) Now 
Xilinx technology. Develop customer- PCI129000-4-ILT PCI-X — Designing for PCI-X (v4.x) December 
required special features to Xilinx We eee ASIC1500-3-|LT FPGA Design for the ASIC User (v3.x) — Instructor-Led Course Now 
or third party cores. Perform integration, 
optimization, and verification of IP cores ASIC25000-4-ILT FPGA Design for the ASIC User (v4.x) — Instructor-Led Course November 
in Xilinx technology. DSP2000-3-ILT DSP Implementation Techniques for Xilinx FPGAs November 
° Embedded Software — Develop complex PROMO-5002-4-LEL E-Series Il (v4) October 


embedded software with real-time constraints, = 
using hardware/software co-design tech- TE TCR One Training Credit ey 
niques. TC-0S-1DY One Day of On-Site Training = Now 
¢ Conversions — Convert ASIC designs and TC-OS-2DY Two days of On-Site Training Now 
other FPGAs to Xilinx technology and TC-OS-3DY Three days of On-Site Training Now 
devices. 
TC-OS-4DY Four days of On-Site Training Now 


aan : : . TC-OS-5DY Five days of On-Site Training 40 Now 
Xilinx Platinum Technical Services 
Platinum Technical Services 


e Senior Applications Engineers SC-PLAT-SVC-10 1 Seat Platinum Technical Service w/10 education credits Now 


¢ Dedicated Toll Free Number* SC-PLAT-SITE-50 Platinum Technical Service site license up to 50 customers Naeeeeral Now 
¢ Priority Case Resolution SC-PLAT-SITE-100 Platinum Technical Service site license for 51-100 customers iy Now 





e Proactive Status Updates SC-PLAT-SITE-150 Platinum Technical Service site license for 101-150 customers Now 
¢ Ten Education Credits Design Services 
e Electronic Newsletter DC-DES-SERV | Design Services Contract | | 


¢ Formal Escalation Process 
¢ Service Packs and Software Updates 


® Application Engineer to Customer Ratio, 


2x Gold Level 


* Available in US only 


Call Education Services Call Design Services Call Technical Services 

North America: 877-XLX-CLASS (877-959-2527) North America & Asia: North America: 800-255-7778 or 408-879-5199 
http://support.xilinx.com/support/training/training.htm Richard Foder: 408-626-4256 http://support.xilinx.com 

Europe: +44-870-7350-548 Europe: United Kingdom: +44-870-7350-610 
eurotraining@xilinx.com Alex Hillier: +44-870-7350-516 eurosupport@xilinx.com 

Japan: +81-03-5321-7750 | Martina Finnerty: +353-1-4032469 Japan: +81-03-5321-7750 
http://support.xilinx.co.jp/support/education-home.htm jhotline@xilinx.com 

Asia Pacific: http://support.xilinx.com/support/education-home.htm Other Locations: 


http://support.xilinx.com/support/services/contact_info.htm 
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Second Generation 

CD Eval 
Kit Available No 


www.xilinx.com/dsp 


















Kal 


Lape roaladnret aaPippacs SELLE 


Product Evaluation 


all pre tae | 


=e 





The kit includes: 

e Simulink® System Modeling Tool from The MathWorks 

e System Generator for DSP and ISE 4.11 Development Software from Xilinx® 
e FPGA Advantage® from Mentor Graphics® 

e SynplityPro” from Synplicity® 


>< XILINX’ 


The Programmable Logic Company™ 


FORTUNE 2002 
100 BEST COMPANIES TO WORK FOR 


©2002 Xilinx, Inc. Xilinx is a registered trademark, Virtex and Spartan are trademarks, and the Programmable Logic Company is a service mark of Xilinx, Inc. All other trademarks and registered trademarks are the property of their respective owners. 





Xilinx, IBM, Wind River Systems, 
CMP (publisher of EETimes), and other industry 

leaders invite you to Programmable World 2002. 

It’s an unprecedented event featuring visionary presentations 
and hands-on technical training sessions from industry 
leaders, delivering all you need to know about the exciting 
new era of systems design. 


Why You Should Attend 
The Forum will highlight new technology poised to revolutionize 
the way electronic systems are designed. 


You will learn how to: 

¢ Implement extreme processing using the industry's fastest 
FPGA fabric and 300 MHz PowerPC processors 

¢ Design 40+ Gbps applications with next-generation 
3.125 Gbps serial |/O technology 

¢ Implement programmable TeraM AC/second DSP systems 


¢ Optimize system performance through 
dynamic hardware and software (re-) partitioning 
¢ Debug hardware and software in parallel, in your systemn— 
at full speed 
¢ Maximize system performance while reducing total 
system cost 


Who Should Attend 

This industry event is a must for product develooment managers, 
system architects, hardware and software engineers, embedded 
and DSP engineers, as well as executives, industry analysts, 
industry press and academia. 


Simulcast LIVE Across North America and Europe 
Programmable World 2002 will be simulcast live to over 50 
cities across North America and Europe including Boston, 
Dallas, Los Angeles, Ottawa, Raleigh, San Jose, and Paris. 
Register today to reserve your seat in the city nearest you! 


Seating Is limited. Register now at www.xilinx.com/pw2002 or visit us In Booth #4644 


>= XILINX’ 


The Programmable Logic Company™ 


@ 


Electronics 
Group 


AWindRwer' CMP 


United Business Media 


© 2002, Xilinx Inc. All rights reserved. The Xilinx name and the Xilinx logo are registered trademarks. The Programmable Logic Company is a service mark of Xilinx Inc. All other trademarks and registered trademarks are the property of their respective owners. 
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